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ISOLATED HUMAN TRANSPORTER PROTEINS, NUCLEIC ACID MOLECULES 
ENCODING HUMAN TRANSPORTER PROTEINS, AND USES THEREOF 



RELATED APPLICATIONS 

5 The present application claims priority to provisional application U.S. Serial No. 

60/240,836, filed October 17, 2000 (Atty. Docket CL000891-PROV) and 09/804,474, filed 
March 13, 2001(Atty. Docket CL000891). 

FIELD OF THE INVENTION 

The present invention is in the field of transporter proteins that are related to the . 
1 0 sodium/calcium exchanger subfamily, recombinant DNA molecules, and protein production. 
The present invention specifically provides novel peptides and proteins that effect ligand 
transport and nucleic acid molecules encoding such peptide and protein molecules, all of which 
are useful in the development of human therapeutics and diagnostic compositions and methods. 

15 BACKGROUND OF THE INVENTION 

Transporters 

Transporter proteins regulate many different functions of a cell, including cell 
proliferation, differentiation, and signaling processes, by regulating the flow of molecules such 
as ions and macromolecules, into and out of cells. Transporters are found in the plasma 

20 membranes of virtually every cell in eukaryotic organisms. Transporters mediate a variety of 
cellular functions including regulation of membrane potentials and absorption and secretion of 
molecules and ion across cell membranes. When present in intracellular membranes of the Golgi 
apparatus and endocytic vesicles, transporters, such as chloride channels, also regulate organelle 
pH. For a review, see Greger, R. (1988) Annu. Rev. Physiol. 50:1 1 1-122. 

25 Transporters are generally classified by structure and the type of mode of action. In 

addition, transporters are sometimes classified by the molecule type that is transported, for 
example, sugar transporters, chlorine channels, potassium channels, etc. There may be many 
classes of channels for transporting a single type of molecule (a detailed review of channel types 
can be found at Alexander, S.P.H. and J. A. Peters: Receptor and transporter nomenclature 
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supplement. Trends Pharmacol. Sci., Elsevier, pp. 65-68 (1997) and http.7/www- "' " " 

biolo&v.ucsd.edu/~msaier/transport/titlepage2.html . 

The following general classification scheme is known in the art and is followed in the 
present discoveries. 

5 Channel-type transporters. Transmembrane channel proteins of this class are ubiquitously 

found in the membranes of all types of organisms from bacteria to higher eukaryotes. Transport 
systems of this type catalyze facilitated diffusion (by an energy-independent process) by passage 
through a transmembrane aqueous pore or channel without evidence for a carrier-mediated 
mechanism. These channel proteins usually consist largely of a-helical spanners, although b- 

1 0 strands may also be present and may even comprise the channel. However, outer membrane 
porin-type channel proteins are excluded from this class and are instead included in class 9. 

Carrier-type transporters. Transport systems are included in this class if they utilize a 
carrier-mediated process to catalyze uniport (a single species is transported by facilitated 
diffusion), antiport (two or more species are transported in opposite directions in a tightly 

1 5 coupled process, not coupled to a direct form of energy other than chemiosmotic energy) and/or 
symport (two or more species arc transported together in the same direction in a tightly coupled 
process, not coupled to a direct form of energy other than chemiosmotic energy). 

Pyrophosphate bond hydrolysis-driven active transporters. Transport systems are 
included in this class if they hydrolyze pyrophosphate or the terminal pyrophosphate bond in 

20 ATP or another nucleoside triphosphate to drive the active uptake and/or extrusion of a solute or 
solutes. The transport protein may or may not be transiently phosphorylated, but the substrate is 
not phosphorylated. 

PEP-dependent, phosphoryl transfer-driven group translocators. Transport systems of the 
bacterial phosphoeno!pyruvate:sugar phosphotransferase system are included in this class. The 
25 product of the reaction, derived from extracellular sugar, is a cytoplasmic sugar-phosphate. 

Decarboxylation-driven active transporters. Transport systems that drive solute (e.g., ion) 
uptake or extrusion by decarboxylation of a cytoplasmic substrate are included in this class. 

Oxidoreduction-driven active transporters. Transport systems that drive transport of a 
solute (e.g., an ion) energized by the flow of electrons from a reduced substrate to an oxidized 
30 substrate are included in this class. 

Light-driven active transporters. Transport systems that utilize light energy to drive 
transport of a solute (e.g., an ion) are included in this class. 
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Mechanically-driven active transporters. Transport systems are included in this class if " * 
they drive movement of a cell or organelle by allowing the flow of ions (or other solutes) 
through the membrane down their electrochemical gradients. 

Outer-membrane porins (of b- structure). These proteins form transmembrane pores or 
5 channels that usually allow the energy independent passage of solutes across a membrane. The 
transmembrane portions of these proteins consist exclusively of b-strands that form a b-barrel. 
These porin-type proteins are found in the outer membranes of Gram-negative bacteria, 
mitochondria and eukaryotic plastids. 

Methyltransferase-driven active transporters. A single characterized protein currently 
1 0 falls into this category, the Na+-transporting methyltetrahydromethanopterinxoenzyme M 
methyhransferase. 

Non-ribosome-synthesized channel-forming peptides or peptide-like molecules. These 
molecules, usually chains of L- and D-amino acids as well as other small molecular building 
blocks such as lactate, form oligomeric transmembrane ion channels. Voltage may induce 
1 5 channel formation by promoting assembly of the transmembrane channel. These peptides are 
often made by bacteria and fungi as agents of biological warfare. 

Non-Proteinaceous Transport Complexes. Ion conducting substances in biological 
membranes that do not consist of or are not derived from proteins or peptides fall into this 
category. 

20 Functionally characterized transporters for which sequence data are lacking. Transporters 

of particular physiological significance will be included in this category even though a family 

assignment cannot be made. 

Putative transporters in which no family member is an established transporter. Putative 

transport protein families are grouped under this number and will either be classified elsewhere 
25 when the transport function of a member becomes established, or will be eliminated from the TC 

classification system if the proposed transport function is disproven. These families include a 

member or members for which a transport function has been suggested, but evidence for such a 

function is not yet compelling. 

Auxiliary transport proteins. Proteins that in some way facilitate transport across one or 
30 more biological membranes but do not themselves participate directly in transport are included in 

this class. These proteins always function in conjunction with one or more transport proteins. 

They may provide a function connected with energy coupling to transport, play a structural role 

in complex formation or serve a regulatory function. 
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Transporters of. unknown classification. Transport protein families of unknown 
classification are grouped under this number and will be classified elsewhere when the transport 
process and energy coupling mechanism are characterized. These families include at least one 
member for which a transport function has been established, but either the mode of transport or 
5 the energy coupling mechanism is not known. 



. Ion channels 

An important type of transporter is the ion channel. Ion channels regulate many different 
cell proliferation, differentiation, and signaling processes by regulating the flow of ions into and 

1 0 out of cells. Ion channels are found in the plasma membranes of virtually every cell in 

eukaryotic organisms. Ion channels mediate a variety of cellular functions including regulation 
of membrane potentials and absorption and secretion of ion across epithelial membranes. When 
present in intracellular membranes of the Golgi apparatus and endocytic vesicles, ion channels, 
such as chloride channels, also regulate organelle pH. For a review, see Greger, R. (1988) Annu. 

1 5 Rev. Physiol. 50: 1 1 1 -122. 

Ion channels are generally classified by structure and the type of mode of action. For 
example, extracellular ligand gated channels (ELGs) are comprised of five polypeptide subunits, 
with each subunit having 4 membrane spanning domains, and are activated by the binding of an 
extracellular ligand to the channel. In addition, channels are sometimes classified by the ion type 

20 that is transported, for example, chlorine channels, potassium channels, etc. There may be many 
classes of channels for transporting a single type of ion (a detailed review of channel types can 
be found at Alexander, S.P.H. and JA. Peters (1997). Receptor and ion channel nomenclature 
supplement. Trends Pharmacol. Sci., Elsevier, pp. 65-68 and http://www- 
biology.ucsd.edu/~msaier/transport/toc.html. 

25 There are many types of ion channels based on structure. For example, many ion 

channels fall within one of the following groups: extracellular ligand-gated channels (ELG), 
intracellular ligand-gated channels (ILG), inward rectifying channels (INR), intercellular (gap 
junction) channels, and voltage gated channels (VIC). There are additionally recognized other 
channel families based on ion-type transported, cellular location and drug sensitivity. Detailed 

30 information on each of these, their activity, ligand type, ion type, disease association, drugability, 
and other information pertinent to the present invention, is well known in the art. 

Extracellular ligand-gated channels, ELGs, are generally comprised of five polypeptide 
subunits, Unwin, N. (1993), Cell 72: 31-41; Unwin, N. (1995), Nature 373: 37-43; Hucho, F., et 
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al., (1996) J. Neurochem. 66: 1781-1792; Hucho, F., et al., (1996) Eur. J. Biochem. 239: 539- 
557; Alexander, S.P.H. and LA. Peters (1997), Trends Pharmacol. Sci., Elsevier, pp. 4-6; 36-40; 
42-44; and Xue, H. (1998) J. Mol. Evol. 47: 323-333. Each subunit has 4 membrane spanning ' 
regions: this serves as a means of identifying other members of the ELG family of proteins. 
5 ELG bind a ligand and in response modulate the flow of ions. Examples of ELG include most 
members of the neurotransmitter-receptor family of proteins, e.g., GABAI receptors. Other 
members of this family of ion channels include glycine receptors, ryandyne receptors, and ligand 
gated calcium channels. 

10 Sodium/Calcium Exchangers 

The protein provided by the present invention is a novel sodium/calcium exchanger. 
Sodium/calcium exchangers (NCX) rapidly import calcium during excitation impulse. 
Intracellular calcium concentrations vary greatly during the excitation/relaxation cycle. In 
contrast, extracellular calcium concentrations are maintained at relatively steady levels, despite 
1 5 wide variations in the amounts of calcium supplied with food. 

There are at least three known mammalian NCX genes and a number of alternatively 
spliced isoforms. NCX sequences are highly conserved. NCX proteins contain 9 transmembrane 
domains and are regulated by calcium and sodium ions and, to some extent, by phosphorylation. 
NCX proteins initiate cardiac myocyte contractions; this effect has been confirmed by in 
20 vitro experiments. Together with calsequestrin, a calcium binding protein, NCX proteins 

maintain calcium homeostasis in the heart muscle. This regulatory mechanism depends on the 
gene dosage, as evident from experiments with transgenic animals. Variations in expression 
levels of these proteins may be associated with some forms of heart disease. 

Calcium transporters can mediate divalent ion toxicity. Barium and strontium can be 
25 carried by these channels into the cell, albeit at slower rates than calcium, which is the natural 
substrate. A panel of bivalent cations, such as copper, lead, cadmium, cobalt and nickel, inhibit 
calcium flow, but do not penetrate the cell membrane. Bivalent and trivalent iron, manganese, 
and zinc show no effect. 

The sequence of the sodium/calcium exchanger provided by the present invention may be 
30 used to screen human populations for mutations associated with neurological conditions and 
heart disease. Furthermore, drugs can be designed that target this and other transporters. 

For a further review of sodium/calcium exchangers, see: Linck et aL, J Pharmacol Exp 
Ther 2000 Aug;294(2):648-57; Shen et al, J Pharmacol Exp Ther 2000 Aug;294(2):562-70; 
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Philipson et al., Annu Rev Physiol 2000;62:1 1 1-33; Zhang et al, Br J Pharmacol 2000 
Jun;130(3):485-8; and Vercesi et ah, FEBS Lett 2000 May 12;473(2):203-6. 

The Voltage-gated Ion C hannel CW\C\ Superfamilv 

Proteins of the VIC family are ion-selective channel proteins found in a wide range of 
bacteria, archaea and eukaryotes Hille, B. (1992), Chapter 9: Structure of channel proteins; 
Chapter 20: Evolution and diversity. In: Ionic Channels of Excitable Membranes, 2nd Ed., 
Sinaur Assoc. Inc., Pubs., Sunderland, Massachusetts; Sigworth, F.J. (1993), Quart. Rev. 
Biophys. 27: 1-40; Salkoff, L. and T. Jegla (1995), Neuron 15: 489-492; Alexander, S.P.H. et al., 

(1997) , Trends Pharmacol. Sci., Elsevier, pp. 76-84; Jan, L.Y. et al., (1997), Annu. Rev. 
Neurosci. 20: 91-123; Doyle, D.A, et al., (1998) Science 280: 69-77; Terlau, H. and W. StuhmW 

(1998) , Naturwissenschaften 85: 437-444. They are often homo- or heterooligomeric structures 
with several dissimilar subunits (e.g., al-a2-d-b Ca 2+ channels, ab,b 2 Na + channels or (a) 4 -b K + 
channels), but the channel and the primary receptor is usually associated with the a (or al) 
subunit. Functionally characterized members are specific for K + , Na + or Ca 2+ . The K + channels 
usually consist of homotetrameric structures with each a-subunit possessing six transmembrane 
spanners (TMSs). The al and a subunits of the Ca 2+ and Na + channels, respectively, are about 
four times as large and possess 4 units, each with 6 TMSs separated by a hydrophilic loop, for a 
total of 24 TMSs. These large channel proteins form heterotetra-unit structures equivalent to the 
homotetrameric structures of most K + channels. All four units of the Ca 2+ and Na + channels are 
homologous to the single unit in the homotetrameric K + channels. Ion flux via the eukaryotic 
channels is generally controlled by the transmembrane electrical potential (hence the 
designation, voltage-sensitive) although some are controlled by ligand or receptor binding. 

Several putative K + -selective channel proteins of the VIC family have been identified in 
prokaryotes. The structure of one of them, the KcsA K + channel of Streptomyces lividans, has 
been solved to 3.2 A resolution. The protein possesses four identical subunits, each with two 
transmembrane helices, arranged in the shape of an inverted teepee or cone. The cone cradles the 
"selectivity filter" P domain in its outer end. The narrow selectivity filter is only 12 A long, 
whereas the remainder of the channel is wider and lined with hydrophobic residues. A large 
water-filled cavity and helix dipoles stabilize K + in the pore. The selectivity filter has two bound 
K + ions about 7.5 A apart from each other. Ion conduction is proposed to result from a balance of 
electrostatic attractive and repulsive forces. 
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In eukaryotes, each VIC family channel type has several subtypes based on 
pharmacological and electrophysiological data. Thus, there are five types of Ca 2+ channels (L, N, 
P, Q and T). There are at least ten types of K + channels, each responding in different ways to 
different stimuli: voltage-sensitive [Ka, Kv, Kvr, Kvs and Ksr], Ca 2+ -sensitive [BKc a , IKc a and 
5 SKca] and receptor-coupled [K M and K A ch]- There are at least six types of Na + channels (I, II, III, 
lil 9 HI and PN3). Tetrameric channels from both prokaryotic and eukaryotic organisms are 
known in which each a-subunit possesses 2 TMSs rather than 6, and these two TMSs are 
homologous to TMSs 5 and 6 of the six TMS unit found in the voltage-sensitive channel 
proteins. KcsA of S. lividans is an example of such a 2 TMS channel protein. These channels 

10 may include the Kn 3 (Na + -activated) and Kvoi (cell volume-sensitive) K + channels, as well as 

distantly related channels such as the Tokl K + channel of yeast, the TWIK-1 inward rectifier K + 
channel of the mouse and the TREK-1 K + channel of the mouse. Because of insufficient 
sequence similarity with proteins of the VIC family, inward rectifier K + IRK channels (ATP- 
regulated; G-protein-activated) which possess a P domain and two flanking TMSs are placed in a 

15 distinct family. However, substantial sequence similarity in the P region suggests that they are 
homologous. The b, g and d subunits of VIC family members, when present, frequently play 
regulatory roles in channel activation/deactivation. 

The Epithelial Na* Channel (ENaC) Family 

The ENaC family consists of over twenty-four sequenced proteins (Canessa, CM., et al., 
20 (1994), Nature 367: 463-467, Le, T. and M.H. Saier, Jr. (1996), Mol. Membr. Biol. 13: 149-157; 
Garty, H. and L.G. Palmer (1997), Physiol. Rev. 77: 359-396; Waldmann, R., et al., (1997), 
Nature 386: 173-177; Darboux, I., et al., (1998), J. Biol. Chem. 273: 9424-9429; Firsov, D., et 
al., (1998), EMBO J. 17: 344-352; Horisberger, J.-D. (1998). Curr. Opin. Struc. Biol. 10: 443- 
449). All are from animals with no recognizable homologues in other eukaryotes or bacteria. 
25 The vertebrate ENaC proteins from epithelial cells cluster tightly together on the phylogenetic 
tree: voltage-insensitive ENaC homologues are also found in the brain. Eleven sequenced C 
elegans proteins, including the degenerins, are distantly related to the vertebrate proteins as well 
as to each other. At least some of these proteins form part of a mechano-transducing complex for 
touch sensitivity. The homologous Helix aspersa (FMRF-amide)-activated Na + channel is the 
30 first peptide neurotransmitter-gated ionotropic receptor to be sequenced. 

Protein members of this family all exhibit the same apparent topology, each with N- and 
C-termini on the inside of the cell, two amphipathic transmembrane spanning segments, and a 
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large extracellular loop. The extracellular domains contain numerous highly conserved cysteine 
residues. They are proposed to serve a receptor function. 

Mammalian ENaC is important for the maintenance of Na + balance and the regulation of 
blood pressure. Three homologous ENaC subunits, alpha, beta, and gamma, have been shown to 
5 assemble to form the highly Na ""-selective channel. The stoichiometry of the three subunits is 
alpha 2 ,betal, gammal in a heterotetrameric architecture. 

The Glutam ate-gated Ion Channel (GIC) Family of Neurotransmitter Receptors 

Members of the GIC family are heteropentameric complexes in which each of the 5 
subunits is of 800-1000 amino acyl residues in length (Nakanishi, N., et al, (1990), Neuron 5: 

10 569-581 ; Unwin, N. (1993), Cell 72: 31-41; Alexander, S.P.H. and J.A. Peters (1997) Trends 

Pharmacol. Sci., Elsevier, pp. 36-40). These subunits may span the membrane three or five times 
as putative a-helices with the N-termini (the glutamate-binding domains) localized 
extracellularly and the C-termini localized cytoplasmically. They may be distantly related to the 
ligand-gated ion channels, and if so, they may possess substantial b-structure in their 

1 5 transmembrane regions. However, homology between these two families cannot be established 
on the basis of sequence comparisons alone. The subunits fall into six subfamilies: a, b, g, d, e 
and z. 

The GIC channels are divided into three types: (1) a-amino-3-hydroxy-5-methyl-4- 
isoxazole propionate (AMPA)-, (2) kainate- and (3) N-methyl-D-aspartate (NMDA)-selective 

20 glutamate receptors. Subunits of the AMPA and kainate classes exhibit 35-40% identity with 
each other while subunits of the NMDA receptors exhibit 22-24% identity with the former 
subunits. They possess large N-terminal, extracellular glutamate-binding domains that are 
homologous to the periplasmic glutamine and glutamate receptors of ABC-type uptake 
permeases of Gram-negative bacteria. All known members of the GIC family are from animals. 

25 The different channel (receptor) types exhibit distinct ion selectivities and conductance 
properties. The NMDA-selective large conductance channels are highly permeable to 
monovalent cations and Ca 2+ . The AMPA- and kainate-selective ion channels are permeable 
primarily to monovalent cations with only low permeability to Ca 2+ . 

The Chloride Channel fCIC) Family 

30 The C1C family is a large family consisting of dozens of sequenced proteins derived from 

Gram-negative and Gram-positive bacteria, cyanobacteria, archaea, yeast, plants and animals 
(Steinmeyer, K., et al., (1991), Nature 354: 301-304; Uchida, S., et al., (1993), J. Biol. Chem. 
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268: 3821-3824; Huang, M.-E., et al., (1994), J. Mol. Biol. 242: 595-598; Kawasaki, M., et al, 

(1994) , Neuron 12: 597-604; Fisher, W.E., et al., (1995), Genomics. 29:598-606; and Foskett, 
J.K. (1998), Annu. Rev. Physiol. 60: 689-717). These proteins are essentially ubiquitous, 
although they are not encoded within genomes of Haemophilus influenzae^ Mycoplasma 

5 genital ium, and Mycoplasma pneumoniae. Sequenced proteins vary in size from 395 amino acyl 
residues (M. jannaschii) to 988 residues (man). Several organisms contain multiple C1C family 
paralogues. For example, Synechocystis has two paralogues, one of 451 residues in length and 
the other of 899 residues. Arabidopsis thaliana has at least four sequenced paralogues, (775-792 
residues), humans also have at least five paralogues (820-988 residues), and C. elegans also has 

10 at least five (810-950 residues). There are nine known members in mammals, and mutations in 
three of the corresponding genes cause human diseases. E. coli, Methanococcus jannaschii and 
Saccharomyces cerevisiae only have one C1C family member each. With the exception of the 
larger Synechocystis paralogue, all bacterial proteins are small (395-492 residues) while all 
eukaryotic proteins are larger (687-988 residues). These proteins exhibit 10-12 putative 

1 5 transmembrane a-helical spanners (TMSs) and appear to be present in the membrane as 

homodimers. While one member of the family, Torpedo CIC-O, has been reported to have two 
channels, one per subunit, others are believed to have just one. 

All functionally characterized members of the C1C family transport chloride, some in a 
voltage-regulated process. These channels serve a variety of physiological functions (cell volume 

20 regulation; membrane potential stabilization; signal transduction; transepithelial transport, etc.). 
Different homologues in humans exhibit differing anion selectivities, i.e., C1C4 and C1C5 share a 
NO3" > Cf > Br" > I" conductance sequence, while C1C3 has an V > CI" selectivity. The C1C4 and 
C1C5 channels and others exhibit outward rectifying currents with currents only at voltages more 
positive than +20mV. 

25 Animal Inward Rectifier K + Channel fIRK-O Family 

IRK channels possess the "minimal channel-forming structure" with only a P domain, 
characteristic of the channel proteins of the VIC family, and two flanking transmembrane 
spanners (Shuck, M.E., et al., (1994), J. Biol. Chem. 269: 24261-24270; Ashen, M.D., et al., 

(1995) , Am. J. Physiol. 268: H506-H51 1 ; Salkoff, L. and T. Jegla (1995), Neuron 15: 489-492; 
30 Aguilar-Bryan, L., et al., (1998), Physiol. Rev. 78: 227-245; Ruknudin, A., et al., (1998), J. Biol. 

Chem. 273: 14165-14171). They may exist in the membrane as homo- or heterooligomers. They 
have a greater tendency to let K + flow into the cell than out. Voltage-dependence may be 
regulated by external K + , by internal Mg 2+ , by internal ATP and/or by G-proteins. The P domains 
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of IRK channels exhibit limited sequence similarity to those of the VIC family, but this sequence 
similarity is insufficient to establish homology. Inward rectifiers play a role in setting cellular 
membrane potentials, and the closing of these channels upon depolarization permits the 
occurrence of long duration action potentials with a plateau phase. Inward rectifiers lack the 
intrinsic voltage sensing helices found in VIC family channels. In a few cases, those of Kir 1.1 a 
and Kir6.2, for example, direct interaction with a member of the ABC superfamily has been 
proposed to confer unique functional and regulatory properties to the heteromeric complex, 
including sensitivity to ATP. The SUR1 sulfonylurea receptor (spQ09428) is the ABC protein 
that regulates the Kir6.2 channel in response to ATP, and CFTR may regulate Kirl.la. Mutations 
in SUR1 are the cause of familial persistent hyperinsulinemic hypoglycemia in infancy (PHHI), 
an autosomal recessive disorder characterized by unregulated insulin secretion in the pancreas. 

ATP-gated Cation Channel fACC) Family 

Members of the ACC family (also called P2X receptors) respond to ATP, a functional 
neurotransmitter released by exocytosis from many types of neurons (North, R.A. (1996), Curr. 
Opin. Cell Biol. 8: 474-483; Soto, F., M. Garcia-Guzman and W. Stiihmer (1997), J. Membr. 
Biol. 160: 91-100). They have been placed into seven groups (P2Xi - P2X 7 ) based on their 
pharmacological properties. These channels, which function at neuron-neuron and neuron- 
smooth muscle junctions, may play roles in the control of blood pressure and pain sensation. 
They may also function in lymphocyte and platelet physiology. They are found only in animals. 

The proteins of the ACC family are quite similar in sequence (>35% identity), but they 
possess 380-1000 amino acyl residues per subunit with variability in length localized primarily 
to the C-terminal domains. They possess two transmembrane spanners, one about 30-50 residues 
from their N-termini, the other near residues 320-340. The extracellular receptor domains 
between these two spanners (of about 270 residues) are well conserved with numerous conserved 
glycyl and cysteyl residues. The hydrophilic C-termini vary in length from 25 to 240 residues. 
They resemble the topologically similar epithelial Na + channel (ENaC) proteins in possessing (a) 
N- and C-termini localized intracellular^, (b) two putative transmembrane spanners, (c) a large 
extracellular loop domain, and (d) many conserved extracellular cysteyl residues. ACC family 
members are, however, not demonstrably homologous with them. ACC channels are probably 
hetero- or homomultimers and transport small monovalent cations (Me 4 ). Some also transport 
Ca 2+ ; a few also transport small metabolites. 



10 



<WO 0233086A2_I_> 



WO 02/33086 



PCT/US01/32152 



The Ryanodine-Inositol 1 A5-triphosphate Receptor Ca Channel (RIR-CaC) Family 

Ryanodine (Ry)-sensitive and inositol 1 ,4,5-triphosphate (IP3)-sensitive Ca 2+ -release 
channels function in the release of Ca 2+ from intracellular storage sites in animal cells and 
thereby regulate various Ca 2+ -dependent physiological processes (Hasan, G. et al., (1992) 
5 Development 1 1 6: 967-975; Michikawa, T., et al. 3 (1994), J. Biol. Chem. 269: 91 84-91 89; 
Tunwell, RE. A., (1996), Biochem. J. 318: 477-487; Lee, A.G. (1996) Biomembranes, Vol. 6, 
Transmembrane Receptors and Channels (A.G. Lee, ed.), JAI Press, Denver, CO., pp 291-326; 
Mikoshiba, et al., (1996) J. Biochem. Biomem. 6: 273-289). Ry receptors occur primarily in 
muscle cell sarcoplasmic reticular (SR) membranes, and BP3 receptors occur primarily in brain 

10 cell endoplasmic reticular (ER) membranes where they effect release of Ca 2+ into the cytoplasm 
upon activation (opening ) of the channel. 

The Ry receptors are activated as a result of the activity of dihydropyridine-sensitive Ca 2+ 
channels. The latter are member* of the voltage-sensitive ion channel (VIC) family. 
Dihydropyridine-sensitive channels are present in the T-tubular systems of muscle tissues. 

15 Ry receptors are homotetrameric complexes with each subunit exhibiting a molecular 

size of over 500,000 daltons (about 5,000 amino acyl residues). They possess C-terminal 
domains with six putative transmembrane a -helical spanners (TMSs). Putative pore-forming 
sequences occur between the fifth and sixth TMSs as suggested for members of the VIC family. 
The large N-terminal hydrophilic domains and the small C-terminal hydrophilic domains are ■ 

20 localized to the cytoplasm. Low resolution 3-dimensional structural data are available. Mammals 
possess at least three iso forms that probably arose by gene duplication and divergence before 
divergence of the mammalian species. Homologues are present in humans and Caenorabditis 
elegans. 

IP3 receptors resemble Ry receptors in many respects. (1) They are homotetrameric 
25 complexes with each subunit exhibiting a molecular size of over 300,000 daltons (about 2,700 

amino acyl residues). (2) They possess C-terminal channel domains that are homologous to those 
of the Ry receptors. (3) The channel domains possess six putative TMSs and a putative channel 
lining region between TMSs 5 and 6. (4) Both the large N-terminal domains and the smaller C- 
terminal tails face the cytoplasm. (5) They possess covalently linked carbohydrate on 
30 extracytoplasmic loops of the channel domains. (6) They have three currently recognized 

isoforms (types 1,2, and 3) in mammals which are subject to differential regulation and have 
different tissue distributions. 
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IP 3 receptors possess three domains: N-terminal IPs-binding domains, central coupling or 
regulatory domains and C-terminal channel domains. Channels are activated by IP 3 binding, and 
like the Ry receptors, the activities of the IP 3 receptor channels are regulated by phosphorylation 
of the regulatory domains, catalyzed by various protein kinases. They predominate in the 
endoplasmic reticular membranes of various cell types in the brain but have also been found in 
the plasma membranes of some nerve cells derived from a variety of tissues. 

The channel domains of the Ry and IP 3 receptors comprise a coherent family that in spite 
of apparent structural similarities, do not show appreciable sequence similarity of the proteins of 
the VIC family. The Ry receptors and the IP 3 receptors cluster separately on the RIR-CaC family 
tree. They both have homologues in Drosophila. Based on the phylogenetic tree for the family, 
the family probably evolved in the following sequence: (1) A gene duplication event occurred^, 
that gave rise to Ry and IP 3 receptors in invertebrates. (2) Vertebrates evolved from 
invertebrates. (3) The three isoforms of each receptor arose as a result of two distinct gene 
duplication events. (4) These isoforms were transmitted to mammals before divergence of the 
mammalian species. 

The Oreane llar Chloride Channel f O-CIO Family 

Proteins of the O-CIC family are voltage-sensitive chloride channels found in 
intracellular membranes but not the plasma membranes of animal cells (Landry, D, et al., (1993), 
J. Biol. Chem. 268: 14948-14955; Valenzuela, Set al., (1997), J. Biol. Chem. 272: 12575-12582; 
and Duncan, R.R., et al., (1997), J. Biol. Chem. 272: 23880-23886). 

They are found in human nuclear membranes, and the bovine protein targets to the 
microsomes, but not the plasma membrane, when expressed in Xenopus laevis oocytes. These 
proteins are thought to function in the regulation of the membrane potential and in transepithelial 
ion absorption and secretion in the kidney. They possess two putative transmembrane a-helical 
spanners (TMSs) with cytoplasmic N- and C-termini and a large luminal loop that may be 
glycosylated. The bovine protein is 437 amino acyl residues in length and has the two putative 
TMSs at positions 223-239 and 367-385. The human nuclear protein is much smaller (241 
residues). A C. elegans homologue is 260 residues long. 

Transporter proteins, particularly members of the sodium/calcium exchanger subfamily, are 
a major target for drug action and development. Accordingly, it is valuable to the field of 
pharmaceutical development to identify and characterize previously unknown transport proteins. 
The present invention advances the state of the art by providing previously unidentified human 
transport proteins. 
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SUMMARY OF THE INVENTION 

The present invention is based in part on the identification of amino acid sequences of 
human transporter peptides and proteins that are related to the sodium/calcium exchanger 
5 subfamily, as well as allelic variants and other mammalian orthologs thereof. These unique 

peptide sequences, and nucleic acid sequences that encode these peptides, can be used as models 
for the development of human therapeutic targets, aid in the identification of therapeutic 
proteins, and serve as targets for the development of human therapeutic agents that modulate 
transporter activity in cells and tissues that express the transporter. Experimental data as 
10 provided in Figure 1 indicates expression in humans in brain, heart, kidney, lung, spleen, testis, 
leukocyte and fetal brain. * ' 

DESCRIPTION OF THE FIGURE SHEETS 

FIGURE 1 provides the nucleotide sequence of a cDNA molecule or transcript sequence 
1 5 that encodes the transporter protein of the present invention (SEQ ID NO: 1 ). In addition 

structure and functional information is provided, such as ATG start, stop and tissue distribution, 
where available, that allows one to readily determine specific uses of inventions based on this 
molecular sequence. Experimental data as provided in Figure 1 indicates expression in humans 
in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. 
20 FIGURE 2 provides the predicted amino acid sequence of the transporter of the present 

invention. (SEQ ID NO:2) In addition structure and functional information such as protein 
family, function, and modification sites is provided where available, allowing one to readily 
determine specific uses of inventions based on this molecular sequence. 

FIGURE 3 provides genomic sequences that span the gene encoding the transporter 
25 protein of the present invention (SEQ ID NO: 3). In addition structure and functional 

information, such as intron/exon structure, promoter location, etc., is provided where available, 
allowing one to readily determine specific uses of inventions based on this molecular sequence. 
140 SNPs, including 6 indels, have been identified in the gene encoding the transporter protein 
provided by the present invention and are given in Figure 3. 

30 
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DETAILED DESCRIPTION OF THE INVENTION 

General Description 

The present invention is based on the sequencing of the human genome. During the 
sequencing and assembly of the human genome, analysis of the sequence information revealed 
previously unidentified fragments of the human genome that encode peptides that share 
structural and/or sequence homology to protein/peptide/domains identified and characterized 
within the an as being a transporter protein or part of a transporter protein and are related to the 
sodium/calcium exchanger subfamily. Utilizing these sequences, additional genomic sequences 
were assembled and transcript and/or cDNA sequences were isolated and characterized. Based 
on this analysis, the present invention provides amino acid sequences of human transporter 
peptides and proteins that are related to the sodium/calcium exchanger subfamily, nucleic acid' 
sequences in the form of transcript sequences, cDNA sequences and/or genomic sequences that 
encode these transporter peptides and proteins, nucleic acid variation (allelic information), tissue 
distribution of expression, and information about the closest art known protein/peptide/domain 
that has structural or sequence homology to the transporter of the present invention. 

In addition to being previously unknown, the peptides that are provided in the present 
invention are selected based on their ability to be used for the development of commercially 
important products and services. Specifically, the present peptides are selected based on 
homology and/or structural relatedness to known transporter proteins of the sodium/calcium 
exchanger subfamily and the expression pattern observed . Experimental data as provided in 
Figure 1 indicates expression in humans in brain, heart, kidney, lung, spleen, testis, leukocyte 
and fetal brain.. The art has clearly established the commercial importance of members of this 
family of proteins and proteins that have expression patterns similar to that of the present gene. 
Some of the more specific features of the peptides of the present invention, and the uses thereof, 
are described herein, particularly in the Background of the Invention and in the annotation 
provided in the Figures, and/or are known within the art for each of the known sodium/calcium 
exchanger family or subfamily of transporter proteins. 
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Specific Embodiments 
Peptide Molecules 

The present invention provides nucleic acid sequences that encode protein molecules that 
have been identified as being members of the transporter family of proteins and are related to the 
5 sodium/calcium exchanger subfamily (protein sequences are provided in Figure 2, 

transcript/cDNA sequences are provided in Figures 1 and genomic sequences are provided in 
Figure 3). The peptide sequences provided in Figure 2, as well as the obvious variants described 
herein, particularly allelic variants as identified herein and using the information in Figure 3, will 
be referred herein as the transporter peptides of the present invention, transporter peptides, or 
1 0 peptides/proteins of the present invention. 

The present invention provides isolated peptide and protein molecules that consist of, 
consist essentially of, or comprising the amino acid sequences of the transporter peptides 
disclosed in the Figure 2, (encoded by the nucleic acid molecule shown in Figure 1, 
transcript/cDNA or Figure 3, genomic sequence), as well as all obvious variants of these 
1 5 peptides that are within the art to make and use. Some of these variants are described in detail 
below. 

As used herein, a peptide is said to be "isolated" or "purified" when it is substantially free 
of cellular material or free of chemical precursors or other chemicals. The peptides of the present 
invention can be purified to homogeneity or other degrees of purity. The level of purification will 

20 be based on the intended use. The critical feature is that the preparation allows for the desired 

function of the peptide, even if in the presence of considerable amounts of other components (the 
features of an isolated nucleic acid molecule is discussed below). 

In some uses, "substantially free of cellular material" includes preparations of the peptide 
having less than about 30% (by dry weight) other proteins (i.e., contaminating protein), less than 

25 about 20% other proteins, less than about 10% other proteins, or less than about 5% other proteins. 
When the peptide is recombinantly produced, it can also be substantially free of culture medium, 
i.e., culture medium represents less than about 20% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of the peptide in which it is separated from chemical precursors or other chemicals that 

30 are involved in its synthesis. In one embodiment, the language "substantially free of chemical 
precursors or other chemicals" includes preparations of the transporter peptide having less than 
about 30% (by dry weight) chemical precursors or other chemicals, less than about 20% chemical 
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precursors or other chemicals, less than about 10% chemical precursors or other chemicals, or less 
than about 5% chemical precursors or other chemicals. 

The isolated transporter peptide can be purified from cells that naturally express it, purified 
from cells that have been altered to express it (recombinant), or synthesized using known protein 
synthesis methods. Experimental data as provided in Figure 1 indicates expression in humans in 
brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. For example, a nucleic acid 
molecule encoding the transporter peptide is cloned into an expression vector, the expression vector 
introduced into a host cell and the protein expressed in the host cell. The protein can then be 
isolated from the cells by an appropriate purification scheme using standard protein purification 
techniques. Many of these techniques are described in detail below. 

Accordingly, the present invention provides proteins that consist of the amino acid 
sequences provided in Figure 2 (SEQ ID NO:2), for example, proteins encoded by the • ' . 

transcript/cDNA nucleic acid sequences shown in Figure 1 (SEQ ID NO: 1 ) and the genomic 
sequences provided in Figure 3 (SEQ ED NO:3). The amino acid sequence of such a protein is 
provided in Figure 2. A protein consists of an amino acid sequence when the amino acid sequence 
is the final amino acid sequence of the protein. 

The present invention further provides proteins that consist essentially of the amino acid 
sequences provided in Figure 2 (SEQ ID NO:2), for example, proteins encoded by the 
transcript/cDNA nucleic acid sequences shown in Figure I (SEQ ID NO:l) and the genomic 
sequences provided in Figure 3 (SEQ ID NO:3). A protein consists essentially of an amino acid 
sequence when such an amino acid sequence is present with only a few additional amino acid 
residues, for example from about 1 to about 100 or so additional residues, typically from 1 to about 
20 additional residues in the final protein. 

The present invention further provides proteins that comprise the amino acid sequences 
provided in Figure 2 (SEQ ID NO:2), for example, proteins encoded by the transcript/cDNA nucleic 
acid sequences shown in Figure 1 (SEQ ID NO:l) and the genomic sequences provided in Figure 3 
(SEQ ID NO:3). A protein comprises an amino acid sequence when the amino acid sequence is at 
least part of the final amino acid sequence of the protein. In such a fashion, the protein can be only 
the peptide or have additional amino acid molecules, such as amino acid residues (contiguous 
encoded sequence) that are naturally associated with it or heterologous amino acid residues/peptide 
sequences. Such a protein can have a few additional amino acid residues or can comprise several 
hundred or more additional amino acids. The preferred classes of proteins that are comprised of the 
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transporter peptides of the present invention are the naturally occurring mature proteins. A brief 
description of how various types of these proteins can be made/isolated is provided below. 

The transporter peptides of the present invention can be attached to heterologous sequences 
to form chimeric or fusion proteins. Such chimeric and fusion proteins comprise a transporter 
5 peptide operati vely linked to a heterologous protein having an amino acid sequence not 
substantially homologous to the transporter peptide. "Operatively linked" indicates that the 
transporter peptide and the heterologous protein are fused in-frame. The heterologous protein can 
be fused to the N-terminus or C-terminus of the transporter peptide. 

In some uses, the fusion protein does not affect the activity of the transporter peptide per se. 

1 0 For example, the fusion protein can include, but is not limited to, enzymatic fusion proteins, for 

example beta-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions, MYC-tagged, 
Hi-tagged and Ig fusions. Such fusion proteins, particularly poly-His fusions, can facilitate the 
purification of recombinant transporter peptide. In certain host cells (e.g., mammalian host cells), 
expression and/or secretion of a protein can be increased by using a heterologous signal sequence. 

15 A chimeric or fusion protein can be produced by standard recombinant DNA techniques. 

For example, DNA fragments coding for the different protein sequences are ligated together in- 
frame in accordance with conventional techniques. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers which give rise to 

20 complementary overhangs between two consecutive gene fragments which can subsequently be 
annealed and re-amplified to generate a chimeric gene sequence (see Ausubel et at, Current * 
Protocols in Molecular Biology, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST protein). A transporter peptide-encoding 
nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked in- 

25 frame to the transporter peptide. 

As mentioned above, the present invention also provides and enables obvious variants of the 
amino acid sequence of the proteins of the present invention, such as naturally occurring mature 
forms of the peptide, allelic/sequence variants of the peptides, non-naturally occurring 
recombinantly derived variants of the peptides, and orthologs and paralogs of the peptides. Such 

30 variants can readily be generated using art-known techniques in the fields of recombinant nucleic 
acid technology and protein biochemistry. It is understood, however, that variants exclude any 
amino acid sequences disclosed prior to the invention. 
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Such variants can readily be identified/made using molecular techniques and the sequence 
information disclosed herein. Further, such variants can readily be distinguished from other 
peptides based on sequence and/or structural homology to the transporter peptides of the present 
invention. The degree of homology/identity present will be based primarily on whether the peptide 
5 is a functional variant or non-functional variant, the amount of divergence present in the paralog 
family and the evolutionary distance between the orthologs. 

To determine the percent identity of two amino acid sequences or two nucleic acid 
sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal 

10 alignment and non-homologous sequences can be disregarded for comparison purposes). In a 
preferred embodiment, at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% or more of a reference 
sequence is aligned for comparison purposes. The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid residue or nucleotide as the 

1 5 corresponding position in the second sequence, then the molecules are identical at that position 
(as used herein amino acid or nucleic acid "identity 11 is equivalent to amino acid or nucleic acid 
"homology"). The percent identity between the two sequences is a function of the number of 
identical positions shared by the sequences, taking into account the number of gaps, and the 
length of each gap, which need to be introduced for optimal alignment of the two sequences. 

20 The comparison of sequences and determination of percent identity and similarity 

between two sequences can be accomplished using a mathematical algorithm. {Computational 
Molecular Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; Computer 
Analysis of Sequence Data, Part 1, Griffin, A.M., and Griffin, H.G., eds., Humana Press, New 

25 Jersey, 1 994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1 987; and 
Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 
1991). In a preferred embodiment, the percent identity between two amino acid sequences is 
determined using the Needleman and Wunsch (J. Mol Biol (48):444-453 (1970)) algorithm 
which has been incorporated into the GAP program in the GCG software package (available at 

30 http://www.gcg.com), using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight 
of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred - 
embodiment, the percent identity between two nucleotide sequences is determined using the 
GAP program in the GCG software package (Devereux, J., et ai, Nucleic Acids Res. J2(1):3S7 
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(1984)) (available at http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of" 
40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the 
percent identity between two amino acid or nucleotide sequences is determined using the 
algorithm of E. Myers and W. Miller (CABIOS, 4:1 1-17 (1989)) which has been incorporated 
5 into the ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length 
penalty of 12 and a gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be used as a 
"query sequence" to perform a search against sequence databases to, for example, identify other 
family members or related sequences. Such searches can be performed using the NBLAST and 

10 XBLAST programs (version 2.0) of Altschul, et al (J. Mol Biol 215:403-10 (1990)). BLAST 
nucleotide searches can be performed with the NBLAST program, score = 100, wordlength = 12 
to obtain nucleotide sequences homologous to the nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, wordlength = 
3 to obtain amino acid sequences homologous to the proteins of the invention. To obtain gapped 

1 5 alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et 
al (Nucleic Acids Res. 25(17):3389-3402 (1997)). When utilizing BLAST and gapped BLAST 
programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) can- 
be used. 

Full-length pre-processed forms, as well as mature processed forms, of proteins that 
20 comprise one of the peptides of the present invention can readily be identified as having complete 
sequence identity to one of the transporter peptides of the present invention as well as being 
encoded by the same genetic locus as the transporter peptide provided herein. As indicated by the 
data presented in Figure 3, the map position was determined to be on chromosome 14 by ePCR. 

Allelic variants of a transporter peptide can readily be identified as being a human protein 
25 having a high degree (significant) of sequence homology/identity to at least a portion of the 

transporter peptide as well as being encoded by the same genetic locus as the transporter peptide 
provided herein. Genetic locus can readily be determined based on the genomic information 
provided in Figure 3, such as the genomic sequence mapped to the reference human. As indicated 
by the data presented in Figure 3, the map position was determined to be on chromosome 14 by 
30 ePCR- As used herein, two proteins (or a region of the proteins) have significant homology 

when the amino acid sequences are typically at least about 70-80%, 80-90%, and more typically 
at least about 90-95% or more homologous. A significantly homologous amino acid sequence, 
according to the present invention, will be encoded by a nucleic acid sequence that will hybridize 
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to a transporter peptide encoding nucleic acid molecule under stringent conditions as more fully ' 
described below. 

Figure 3 provides information on SNPs that have been identified in a gene encoding the 
transporter protein of the present invention. 140 SNP variants were found, including 6 indels 
(indicated by a "-") and 1 SNPs in exons. The others were found in in introns and regions 5' and 
3' of the ORF. Such SNPs in introns and outside the ORF may affect control/regulatory 
elements. 

Paralogs of a transporter peptide can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the transporter peptide, as being 
encoded by a gene from humans, and as having similar activity or function. Two proteins will 
typically be considered paralogs when the amino acid sequences are typically at least about 60% 
or greater, and more typically at least about 70% or greater homology through a given region or. 
domain. Such paralogs will be encoded by a nucleic acid sequence that will hybridize to a 
transporter peptide encoding nucleic acid molecule under moderate to stringent conditions as 
more fully described below. 

Orthologs of a transporter peptide can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the transporter peptide as well as 
being encoded by a gene from another organism. Preferred orthologs will be isolated from 
mammals, preferably primates, for the development of human therapeutic targets and agents. Such 
orthologs will be encoded by a nucleic acid sequence that will hybridize to a transporter peptide 
encoding nucleic acid molecule under moderate to stringent conditions, as more fully described 
below, depending on the degree of relatedness of the two organisms yielding the proteins. 

Non-naturally occurring variants of the transporter peptides of the present invention can 
readily be generated using recombinant techniques. Such variants include, but are not limited to 
deletions, additions and substitutions in the amino acid sequence of the transporter peptide. For 
example, one class of substitutions are conserved amino acid substitution. Such substitutions are 
those that substitute a given amino acid in a transporter peptide by another amino acid of like 
characteristics. Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu, and lie; interchange of the hydroxyl residues Ser 
and Thr; exchange of the acidic residues Asp and Glu; substitution between the amide residues Asn 
and Gin; exchange of the basic residues Lys and Arg; and replacements among the aromatic 
residues Phe and Tyr. Guidance concerning which amino acid changes are likely to be 
phenotypically silent are found in Bowie et ai, Science 247:1306-1310 (1990). 
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Variant transporter peptides can be fully functional or can lack function in one or more 
activities, e.g. ability to bind ligand, ability to transport ligand, ability to mediate signaling, etc. 
Fully functional variants typically contain only conservative variation or variation in non-critical 
residues or in non-critical regions. Figure 2 provides the result of protein analysis and can be used 
5 to identify critical domains/regions. Functional variants can also contain substitution of similar 
amino acids that result in no change or an insignificant change in function. Alternatively, such 
substitutions may positively or negatively affect function to some degree. 

Non-functional variants typically contain one or more non-conservative amino acid 
substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, inversion, or 
1 0 deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods known in the art, 
such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et aL 9 Science 
244:1081-1085 (1989)), particularly using the results provided in Figure 2. The latter procedure 
introduces single alanine mutations at every residue in the molecule. The resulting mutant 
1 5 molecules are then tested for biological activity such as transporter activity or in assays such as an 
in vitro proliferative activity. Sites that are critical for binding partner/substrate binding can also be 
determined by structural analysis such as crystallization, nuclear magnetic resonance or 
photoaffinity labeling (Smith et al, 1 Mol Biol 224:899-904 (1992); de Vos et al Science 
255:306-312(1992)). 

20 The present invention further provides fragments of the transporter peptides, in addition to 

proteins and peptides that comprise and consist of such fragments, particularly those comprising the 
residues identified in Figure 2. The fragments to which the invention pertains, however, are not to 
be construed as encompassing fragments that may be disclosed publicly prior to the present 
invention. 

25 As used herein, a fragment comprises at least 8, 10, 12, 14, 16, or more contiguous amino 

acid residues from a transporter peptide. Such fragments can be chosen based on the ability to 
retain one or more of the biological activities of the transporter peptide or could be chosen for the 
ability to perform a function, e.g. bind a substrate or act as an immunogen. Particularly important 
fragments are biologically active fragments, peptides that are, for example, about 8 or more amino 

30 acids in length. Such fragments will typically comprise a domain or motif of the transporter peptide, 
e.g., active site, a transmembrane domain or a substrate-binding domain. Further, possible 
fragments include, but are not limited to, domain or motif containing fragments, soluble peptide 
fragments, and fragments containing immunogenic structures. Predicted domains and functional 
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sites are readily identifiable by computer programs well known and readily available to those of 
skill in the art (e.g., PROSITE analysis). The results of one such analysis are provided in Figure 2. 

Polypeptides often contain amino acids other than the 20 amino acids commonly referred to 
as the 20 naturally occurring amino acids. Further, many amino acids, including the terminal amino 
acids, may be modified by natural processes, such as processing and other post-translational 
modifications, or by chemical modification techniques well known in the art. Common ' 
modifications that occur naturally in transporter peptides are described in basic texts, detailed 
monographs, and the research literature, and they are well known to those of skill in the art (some of 
these features are identified in Figure 2). 

Known modifications include, but are not limited to, acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, 
covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent crosslinks, formation of cystine, formation of 
pyroglutamate, formylation, gamma carboxylation, glycosylate, GPI anchor formation, 
hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing,' 
phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA mediated 
addition of amino acids to proteins such as arginylation, and ubiquitination. 

Such modifications are well known to those of skill in the art and have been described in 
great detail in the scientific literature. Several particularly common modifications, glycosylation, 
lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, hydroxylation and 
ADP-ribosylation, for instance, are described in most basic texts, such as Proteins - Structure and 
Molecular Properties, 2nd Ed., T.E. Creighton, W. H. Freeman and Company, New York (1993). 
Many detailed reviews are available on this subject, such as by Wold, F., Posttranslational Covalent 
Modification of Proteins, B.C. Johnson, Ed., Academic Press, New York 1-12 (1983); Seifter et al. 
(Meth. Enzymol. 182: 626-646 (1990)) and Rattan et al. (Ann. N. Y. Acad. Sci. 663A8-62 (1992)). 

Accordingly, the transporter peptides of the present invention also encompass derivatives or 
analogs in which a substituted amino acid residue is not one encoded by the genetic code, in which 
a substituent group is included, in which the mature transporter peptide is fused with another 
compound, such as a compound to increase the half-life of the transporter peptide (for example, 
polyethylene glycol), or in which the additional amino acids are fused to the mature transporter' 
peptide, such as a leader or secretory sequence or a sequence for purification of the mature 
transporter peptide or a pro-protein sequence. 
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Protein/Peptide Uses 

The proteins of the present invention can be used in substantial and specific assays 
related to the functional information provided in the Figures; to raise antibodies or to elicit 
5 another immune response; as a reagent (including the labeled reagent) in assays designed to 
quantitatively determine levels of the protein (or its binding partner or ligand) in biological 
fluids; and as markers for tissues in which the corresponding protein is preferentially expressed 
(either constitutively or at a particular stage of tissue differentiation or development or in a 
disease state). Where the protein binds or potentially binds to another protein or ligand (such as, 

10 for example, in a transporter-effector protein interaction or transporter-Iigand interaction), the 
protein can be used to identify the binding partner/ligand so as to develop a system to identify v 
inhibitors of the binding interaction. Any or all of these uses are capable of being developed into 
reagent grade or kit format for commercialization as commercial products. 

Methods for performing the uses listed above are well known to those skilled in the art. 

15 References disclosing such methods include "Molecular Cloning: A Laboratory Manual", 2d ed., 
Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. Maniatis eds., 1989, 
and "Methods in Enzymology: Guide to Molecular Cloning Techniques", Academic Press, 
Berger, S. L. and A. R. Kimmel eds., 1987. 

The potential uses of the peptides of the present invention are based primarily on the 

20 source of the protein as well as the class/action of the protein. For example, transporters isolated 
from humans and their human/mammalian orthologs serve as targets for identifying agents for 
use in mammalian therapeutic applications, e.g. a human drug, particularly in modulating a 
biological or pathological response in a cell or tissue that expresses the transporter. 
Experimental data as provided in Figure 1 indicates that sodium/calcium exchanger proteins of 

25 the present invention are expressed in humans in the heart, retina, kidney, fetal brain, and fetal 
heart. Specifically, a virtual northern blot shows expression in the fetal brain. In addition, PCR- 
based tissue screening panel indicates expression in brain, heart, kidney, lung, spleen, testis, 
leukocyte and fetal brain. A large percentage of pharmaceutical agents are being developed that 
modulate the activity of transporter proteins, particularly members of the sodium/calcium 

30 exchanger subfamily (see Background of the Invention). The structural and functional 

information provided in the Background and Figures provide specific and substantial uses for the 
molecules of the present invention, particularly in combination with the expression information 
provided in Figure 1 . Experimental data as provided in Figure 1 indicates expression in humans in 
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brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. Such uses can readily be 
determined using the information provided herein, that known in the art and routine 
experimentation. 

The proteins of the present invention (including variants and fragments that may have been 
5 disclosed prior to the present invention) are useful for biological assays related to transporters that 
are related to members of the sodium/calcium exchanger subfamily. Such assays involve any of the 
known transporter functions or activities or properties useful for diagnosis and treatment of 
transporter-related conditions that are specific for the subfamily of transporters that the one of the 
present invention belongs to, particularly in cells and tissues that express the transporter. 
1 0 Experimental data as provided in Figure 1 indicates that sodium/calcium exchanger proteins of the 
present invention are expressed in humans in the heart, retina, kidney, fetal brain, and fetal heart. 
Specifically, a virtual northern blot shows expression in the fetal brain. In addition, PCR-based 
tissue screening panel indicates expression in brain, heart, kidney, lung, spleen, testis, leukocyte and 
fetal brain. 

1 5 The proteins of the present invention are also useful in drug screening assays, in cell-based 

or cell-free systems ((Hodgson, Bio/technology, 1992, Sept 10(9);973-80). Cell-based systems can 
be native, i.e., cells that normally express the transporter, as a biopsy or expanded in cell culture. 
Experimental data as provided in Figure 1 indicates expression in humans in brain, heart, kidney, 
lung, spleen, testis, leukocyte and fetal brain. In an alternate embodiment, cell-based assays involve 

20 recombinant host cells expressing the transporter protein. 

The polypeptides can be used to identify compounds that modulate transporter activity of 
the protein in its natural state or an altered form that causes a specific disease or pathology 
associated with the transporter. Both the transporters of the present invention and appropriate 
variants and fragments can be used in high-throughput screens to assay candidate compounds for 

25 the ability to bind to the transporter. These compounds can be further screened against a functional 
transporter to determine the effect of the compound on the transporter activity. Further, these 
compounds can be tested in animal or invertebrate systems to determine activity/effectiveness. 
Compounds can be identified that activate (agonist) or inactivate (antagonist) the transporter to a 
desired degree. 

30 Further, the proteins of the present invention can be used to screen a compound for the 

ability to stimulate or inhibit interaction between the transporter protein and a molecule that 
normally interacts with the transporter protein, e.g. a substrate or a component of the signal pathway 
that the transporter protein normally interacts (for example, another transporter). Such assays 
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typically include the steps of combining the transporter protein with a candidate compound under 
conditions that allow the transporter protein, or fragment, to interact with the target molecule, and to 
detect the formation of a complex between the protein and the target or to detect the biochemical 
consequence of the interaction with the transporter protein and the target, such as any of the 
5 associated effects of signal transduction such as changes in membrane potential, protein 
phosphorylation, cAMP turnover, and adenylate cyclase activation, etc. 

Candidate compounds include, for example, 1) peptides such as soluble peptides, including 
Ig-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam et al , Nature 
554:82-84 (1991); Houghten et al, Nature 554:84-86 (1991)) and combinatorial chemistry-derived 

1 0 molecular libraries made of D- and/or L- configuration amino acids; 2) phosphopeptides (e.g., 

members of random and partially degenerate, directed phosphopeptide libraries, see, e.g., Songyang 
et al 9 Cell 72:767-77$ (1993)); 3) antibodies (e.g., polyclonal, monoclonal, humanized, anti- 
idiotype, chimeric, and single chain antibodies as well as Fab, F(ab')2, Fab expression library 
fragments, and epitope-binding fragments of antibodies); and 4) small organic and inorganic 

1 5 molecules (e.g., molecules obtained from combinatorial and natural product libraries). 

One candidate compound is a soluble fragment of the receptor that competes for ligand 
binding. Other candidate compounds include mutant transporters or appropriate fragments 
containing mutations that affect transporter function and thus compete for ligand. Accordingly, a 
fragment that competes for ligand, for example with a higher affinity, or a fragment that binds 

20 ligand but does not allow release, is encompassed by the invention. 

The invention further includes other end point assays to identify compounds that modulate 
(stimulate or inhibit) transporter activity. The assays typically involve an assay of events in the 
signal transduction pathway that indicate transporter activity. Thus, the transport of a ligand, 
change in cell membrane potential, activation of a protein, a change in the expression of genes that 

25 are up- or down-regulated in response to the transporter protein dependent signal cascade can be 
assayed. 

Any of the biological or biochemical functions mediated by the transporter can be used as an 
endpoint assay. These include all of the biochemical or biochemical/biological events described 
herein, in the references cited herein, incorporated by reference for these endpoint assay targets, and 
30 other functions known to those of ordinary skill in the art or that can be readily identified using the 
information provided in the Figures, particularly Figure 2. Specifically, a biological function of a 
cell or tissues that expresses the transporter can be assayed. Experimental data as provided in 
Figure 1 indicates that sodium/calcium exchanger proteins of the present invention are expressed in 
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humans in the heart, retina, kidney, fetal brain, and fetal heart. Specifically, a virtual northern blot 
shows expression in the fetal brain. In addition, PCR-based tissue screening panel indicates 
expression in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. 

Binding and/or activating compounds can also be screened by using chimeric transporter 
proteins in which the amino terminal extracellular domain, or parts thereof, the entire 
transmembrane domain or subregions, such as any of the seven transmembrane segments or any of 
the intracellular or extracellular loops and the carboxy terminal intracellular domain, or parts 
thereof, can be replaced by heterologous domains or subregions. For example, a ligand-binding 
region can be used that interacts with a different ligand then that which is recognized by the native 
transporter. Accordingly, a different set of signal transduction components is available as an end- 
point assay for activation. Ibis allows for assays to be performed in other than the specific host cell 
from which the transporter is derived. 

The proteins of the present invention are also useful in competition binding assays in 
methods designed to disco* or compounds that interact with the transporter (e.g. binding partners 
and/or ligands). Thus, a compound is exposed to a transporter polypeptide under conditions that 
allow the compound to bind or to otherwise interact with the polypeptide. Soluble transporter 
polypeptide is also added to the mixture. If the test compound interacts with the soluble transporter 
polypeptide, it decreases the amount of complex formed or activity from the transporter target. This 
type of assay is particularly useful in cases in which compounds are sought that interact with 
specific regions of the transporter. Thus, the soluble polypeptide that competes with the target 
transporter region is designed to contain peptide sequences corresponding to the region of interest. 

To perform cell free drug screening assays, it is sometimes desirable to immobilize either 
the transporter protein, or fragment, or its target molecule to facilitate separation of complexes from 
uncomplexed forms of one or both of the proteins, as well as to accommodate automation of the 
assay. 

Techniques for immobilizing proteins on matrices can be used in the drug screening assays. 
In one embodiment, a fusion protein can be provided which adds a domain that allows the protein to 
be bound to a matrix. For example, glutathione-S-transferase fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre 
plates, which are then combined with the cell lysates (e.g., 35 S-labeled) and the candidate 
compound, and the mixture incubated under conditions conducive to complex formation (e.g., at 
physiological conditions for salt and pH). Following incubation, the beads are washed to remove 
any unbound label, and the matrix immobilized and radiolabel determined directly, or in the 
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supernatant after the complexes are dissociated. Alternatively, the complexes can be dissociated 
from the matrix, separated by SDS-PAGE, and the level of transporter-binding protein found in the 
bead fraction quantitated from the gel using standard electrophoretic techniques. For example, 
either the polypeptide or its target molecule can be immobilized utilizing conjugation of biotin and 
5 streptavidin using techniques well known in the art. Alternatively, antibodies reactive with the 
protein but which do not interfere with binding of the protein to its target molecule can be 
dcri vatized to the wells of the plate, and the protein trapped in the wells by antibody conjugation. 
Preparations of a transporter-binding protein and a candidate compound are incubated in the 
transporter protein-presenting wells and the amount of complex trapped in the well can be 

1 0 quantitated. Methods for detecting such complexes, in addition to those described above for the 
GST-immobilized complexes, include immunodetection of complexes using antibodies reactive 
with the transporter protein target molecule, or which are reactive with transporter protein and 
compete with the target molecule, as well as enzyme-linked assays which rely on detecting an 
enzymatic activity associated with the target molecule. 

1 5 Agents that modulate one of the transporters of the present invention can be identified using 

one or more of the above assays, alone or in combination. It is generally preferable to use a cell- 
based or cell free system first and then confirm activity in an animal or other model system. Such 
model systems are well known in the art and can readily be employed in this context. 

Modulators of transporter protein activity identified according to these drug screening 

20 assays can be used to treat a subject with a disorder mediated by the transporter pathway, by treating 
cells or tissues that express the transporter. Experimental data as provided in Figure 1 indicates 
expression in humans in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. These 
methods of treatment include the steps of administering a modulator of transporter activity in a 
pharmaceutical composition to a subject in need of such treatment, the modulator being identified as 

25 described herein. 

In yet another aspect of the invention, the transporter proteins can be used as "bait 
proteins 1 ' in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 5,283,317; 
Zervos et al (1993) Cell 72:223-232; Madura etal (1993) J. Biol Chem. 268:12046-12054; 
Bartel etal (1993) Biotechniques 14:920-924; Iwabuchi etal (1993) Oncogene 8:1693-1696; 

30 and Brent WO94/10300), to identify other proteins, which bind to or interact with the transporter 
and are involved in transporter activity. Such transporter-binding proteins are also likely to be 
involved in the propagation of signals by the transporter proteins or transporter targets as, for 
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example, downstream elements of a transporter-mediated signaling pathway. Alternatively, such 
transporter-binding proteins are likely to be transporter inhibitors. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two 
5 different DNA constructs. In one construct, the gene that codes for a transporter protein is fused 
to a gene encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In 
the other construct, a DNA sequence, from a library of DNA sequences, that encodes an 
unidentified protein ("prey" or "sample") is fused to a gene that codes for the activation domain 
of the known transcription factor. If the "bait" and the "prey" proteins are able to interact, in 

1 0 vivo, forming a transporter-dependent complex, the DNA-binding and activation domains of the 
transcription factor are brought into close proximity. This proximity allows transcription of a 
reporter gene (e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive 
to the transcription factor. Expression of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be isolated and used to obtain the cloned gene 

1 5 which encodes the protein which interacts with the transporter protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an agent 
identified as described herein in an appropriate animal model. For example, an agent identified 
as described herein (e.g., a transporter-modulating agent, an antisense transporter nucleic acid 

20 molecule, a transporter-specific antibody, or a transporter-binding partner) can be used in an 
animal or other model to determine the efficacy, toxicity, or side effects of treatment with such 
an agent. Alternatively, an agent identified as described herein can be used in an animal or other 
model to determine the mechanism of action of such an agent. Furthermore, this invention 
pertains to uses of novel agents identified by the above-described screening assays for treatments 

25 as described herein. 

The transporter proteins of the present invention are also useful to provide a target for 
diagnosing a disease or predisposition to disease mediated by the peptide. Accordingly, the 
invention provides methods for detecting the presence, or levels of, the protein (or encoding 
mRNA) in a cell, tissue, or organism. Experimental data as provided in Figure 1 indicates 
30 expression in humans in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. The 
method involves contacting a biological sample with a compound capable of interacting with the 
transporter protein such that the interaction can be detected. Such an assay can be provided in a 
single detection format or a multi-detection format such as an antibody chip array. 

28 



:SDOCID: <WO_0233086A2J_> 



WO 02/33086 PCT/US01/32152 

One agent for detecting a protein in a sample is an antibody capable of selectively binding to 
protein. A biological sample includes tissues, cells and biological fluids isolated from a subject, as 
well as tissues, cells and fluids present within a subject. 

The peptides of the present invention also provide targets for diagnosing active protein 
5 activity, disease, or predisposition to disease, in a patient having a variant peptide, particularly 
activities and conditions that are known for other members of the family of proteins to which the 
present one belongs. Thus, the peptide can be isolated from a biological sample and assayed for the 
presence of a genetic mutation that results in aberrant peptide. This includes amino acid 
substitution, deletion, insertion, rearrangement, (as the result of aberrant splicing events), and 

1 0' inappropriate post-translational modification. Analytic methods include altered electrophoretic 
mobility, altered tryptic peptide digest, altered transporter activity in cell-based or cell-free assay, 
alteration in ligand or antibody-binding pattern, altered isoelectric point, direct amino acid 
sequencing, and any other of the known assay techniques useful for detecting mutations in a protein. 
Such an assay can be provided in a single detection format or a multi-detection format such as an 

1 5 antibody chip array. 

In vitro techniques for detection of peptide include enzyme linked immunosorbent assays 
(ELIS As), Western blots, immunoprecipitations and immunofluorescence using a detection reagent, 
such as an antibody or protein binding agent. Alternatively, the peptide can be detected in vivo in a 
subject by introducing into the subject a labeled anti-peptide antibody or other types of detection 

20 agent. For example, the antibody can be labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging techniques. Particularly useful are 
methods that detect the allelic variant of a peptide expressed in a subject and methods which detect 
fragments of a peptide in a sample. 

The peptides are also useful in phaimacogenomic analysis. Pharmacogenomics deal with 

25 clinically significant hereditary variations in the response to drugs due to altered drug disposition 

and abnormal action in affected persons. See, e.g., Eichelbaum, M. (Clin Exp. Pharmacol, Physiol. 
23(10-1 l):983-985 (1996)), and Under, M.W. (Clin Chem. 43(2):254-266 (1997)). The clinical 
outcomes of these variations result in severe toxicity of therapeutic drugs in certain individuals or 
therapeutic failure of drugs in certain individuals as a result of individual variation in metabolism. 

30 Thus, the genotype of the individual can determine the way a therapeutic compound acts on the 
body or the way the body metabolizes the compound. Further, the activity of drug metabolizing 
enzymes effects both the intensity and duration of drug action. Thus, the pharmacogenomics of the 
individual permit the selection of effective compounds and effective dosages of such compounds for 
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prophylactic or therapeutic treatment based on the individual's genotype. The discovery of genetic 
polymorphisms in some drug metabolizing enzymes has explained why some patients do not obtain 
the expected drug effects, show an exaggerated drug effect, or experience serious toxicity from 
standard drug dosages. Polymorphisms can be expressed in the phenotype of the extensive 
5 metabolizer and the phenotype of the poor metabolizer. Accordingly, genetic polymorphism may 
lead to allelic protein variants of the transporter protein in which one or more of the transporter 
functions in one population is different from those in another population. The peptides thus allow a 
target to ascertain a genetic predisposition that can affect treatment modality. Thus, in a ligand- 
based treatment, polymorphism may give rise to amino terminal extracellular domains and/or other 

1 0 ligand-binding regions that are more or less active in ligand binding, and transporter activation. 
Accordingly, ligand dosage would necessarily be modified to maximize the therapeutic effect 
within a given population containing a polymorphism. As an alternative to genotyping, specific 
polymorphic peptides could be identified. 

The peptides are also useful for treating a disorder characterized by an absence of, 

1 5 inappropriate, or unwanted expression of the protein. Experimental data as provided in Figure 1 

indicates expression in humans in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. 
Accordingly, methods for treatment include the use of the transporter protein or fragments. 



Antibodies 

20 The invention also proyides antibodies that selectively bind to one of the peptides of the 

present invention, a protein comprising such a peptide, as well as variants and fragments thereof. 
As used herein, an antibody selectively binds a target peptide when it binds the target peptide and 
does not significantly bind to unrelated proteins. An antibody is still considered to selectively bind 
a peptide even if it also binds to other proteins that are not substantially homologous with the target 

25 peptide so long as such proteins share homology with a fragment or domain of the peptide target of 
the antibody. In this case, it would be understood that antibody binding to the peptide is still 
selective despite some degree of cross-reactivity. 

As used herein, an antibody is defined in terms consistent with that recognized within the 
art: they are multi-subunit proteins produced by a mammalian organism in response to an antigen 

30 challenge. The antibodies of the present invention include polyclonal antibodies and monoclonal 
antibodies, as well as fragments of such antibodies, including, but not limited to, Fab or F(ab'>2, and 
Fv fragments. 
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Many methods are known for generating and/or identifying antibodies to a given target 
peptide. Several such methods are described by Harlow, Antibodies, Cold Spring Harbor Press, 
(1989). 

In general, to generate antibodies, an isolated peptide is used as an immunogen and is 
5 administered to a mammalian organism, such as a rat, rabbit or mouse. The full-length protein, an 
antigenic peptide fragment or a fusion protein can be used. Particularly important fragments are 
those covering functional domains, such as the domains identified in Figure 2, and domain of 
sequence homology or divergence amongst the family, such as those that can readily be identified 
using protein alignment methods and as presented in the Figures. 

10 Antibodies are preferably prepared from regions or discrete fragments of the transporter 

proteins. Antibodies can be prepared from any region of the peptide as described herein. 
However, preferred regions will include those involved in function/activity and/or 
transporter/binding partner interaction. Figure 2 can be used to identify particularly important 
regions while sequence alignment can be used to identify conserved and unique sequence 

15 fragments. 

An antigenic fragment will typically comprise at least 8 contiguous amino acid residues. 
The antigenic peptide can comprise, however, at least 10, 12, 14, 16 or more amino acid residues. 
Such fragments can be selected on a physical property, such as fragments correspond to regions that 
are located on the surface of the protein, e.g., hydrophilic regions or can be selected based on 

20 sequence uniqueness (see Figure 2). 

Detection on an antibody of the present invention can be facilitated by coupling (i.e., 
physically linking) the antibody to a detectable substance. Examples of detectable substances 
include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. Examples of suitable enizymes include 

25 horseradish peroxidase, alkaline phosphatase, p-galactosidase, or acetylcholinesterase; examples of 
suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include luciferase, 

30 luciferin, and aequorin, and examples of suitable radioactive material include 125 1, 13 1 I, 35 S or 3 H. 
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Antibody Uses 

The antibodies can be used to isolate one of the proteins of the present invention by standard 
techniques, such as affinity chromatography or immunoprecipitation. The antibodies can facilitate 
the purification of the natural protein from cells and recombinantly produced protein expressed in 
5 host cells. In addition, such antibodies are useful to detect the presence of one of the proteins of the 
present invention in cells or tissues to determine the pattern of expression of the protein among 
various tissues in an organism and over the course of normal development. Experimental data as 
provided in Figure 1 indicates that sodium/calcium exchanger proteins of the present invention are 
expressed in humans in the heart, retina, kidney, fetal brain, and fetal heart. Specifically, a virtual 

1 0 northern blot shows expression in the fetal brain. In addition, PCR-based tissue screening panel 
indicates expression in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. Further, 
such antibodies can be used to detect protein in situ, in vitro, or in a cell lysate or supernatant in 
order to evaluate the abundance and pattern of expression. Also, such antibodies can be used to 
assess abnormal tissue distribution or abnormal expression during development or progression of a 

1 5 biological condition. Antibody detection of circulating fragments of the full length protein can be 
used to identify turnover. 

Further, the antibodies can be used to assess expression in disease states such as in active 
stages of the disease or in an individual with a predisposition toward disease related to the protein's 
function. When a disorder is caused by an inappropriate tissue distribution, developmental 

20 expression, level of expression of the protein, or expressed/processed form, the antibody can be 

prepared against the normal protein. Experimental data as provided in Figure 1 indicates expression 
in humans in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. If a disorder is 
characterized by a specific mutation in the protein, antibodies specific for this mutant protein can be 
used to assay for the presence of the specific mutant protein. 

25 The antibodies can also be used to assess normal and aberrant subcellular localization of 

cells in the various tissues in an organism. Experimental data as provided in Figure 1 indicates 
expression in humans in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. The 
diagnostic uses can be applied, not only in genetic testing, but also in monitoring a treatment 
modality. Accordingly, where treatment is ultimately aimed at correcting expression level or the 

30 presence of aberrant sequence and aberrant tissue distribution or developmental expression, 

antibodies directed against the protein or relevant fragments can be used to monitor therapeutic 
efficacy. 
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Additionally, antibodies are useful in pharmacogenomic analysis. Thus, antibodies prepared 
against polymorphic proteins can be used to identify individuals that require modified treatment 
modalities. The antibodies are also useful as diagnostic tools as an immunological marker for 
aberrant protein analyzed by electrophoretic mobility, isoelectric point, tryptic peptide digest, and 
5 other physical assays known to those in the art. 

The antibodies are also useful for tissue typing. Experimental data as provided in Figure 1 
indicates expression in humans in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. 
Thus, where a specific protein has been correlated with expression in a specific tissue, antibodies 
that are specific for this protein can be used to identify a tissue type. 

1 0 The antibodies are also useful for inhibiting protein function, for example, blocking the 

binding of the transporter peptide to a binding partner such as a ligand or protein binding partner. 
These uses can also be applied in a therapeutic context in which treatment involves inhibiting the 
protein's function. An antibody can be used, for example, to block binding, thus modulating 
(agonizing or antagonizing) the peptides activity. Antibodies can be prepared against specific 

1 5 fragments containing sites required for function or against intact protein that is associated with a cell 
or cell membrane. See Figure 2 for structural information relating to the proteins of the present 
invention. 

The invention also encompasses kits for using antibodies to detect the presence of a protein 
in a biological sample. The kit can comprise antibodies such as a labeled or labelable antibody and 

20 a compound or agent for detecting protein in a biological sample; means for determining the amount 
of protein in the sample; means for comparing the amount of protein in the sample with a standard; 
and instructions for use. Such a kit can be supplied to detect a single protein or epitope or can be 
configured to detect one of a multitude of epitopes, such as in an antibody detection array. Arrays 
are described in detail below for nucleic acid arrays and similar methods have been developed for 

25 antibody arrays. 

Nucleic Acid Molecules 

The present invention further provides isolated nucleic acid molecules that encode a 
transporter peptide or protein of the present invention (cDNA, transcript and genomic sequence). 
30 Such nucleic acid molecules will consist of, consist essentially of, or comprise a nucleotide 

sequence that encodes one of the transporter peptides of the present invention, an allelic variant 
thereof, or an ortholog or paralog thereof. 
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,-. As used herein, an "isolated" nucleic acid molecule is one that is separated from other 
nucleic acid present in the natural source of the nucleic acid. Preferably, an "isolated" nucleic acid 
is free of sequences that naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' ends 
of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. 
5 However, there can be some flanking nucleotide sequences, for example up to about 5KB, 4KB, 
3KB, 2KB, or 1 KB or less, particularly contiguous peptide encoding sequences and peptide 
encoding sequences within the same gene but separated by introns in the genomic sequence. The 
important point is that the nucleic acid is isolated from remote and unimportant flanking sequences 
such that it can be subjected to the specific manipulations described herein such as recombinant 

1 0 expression, preparation of probes and primers, and other uses specific to the nucleic acid sequences. 

Moreover, an "isolated" nucleic acid molecule, such as a transcript/cDNA molecule can be 
substantially free of other cellular material, or culture medium when produced by recombinant 
techniques, or chemical precursors or other chemicals when chemically synthesized. However, the 
nucleic acid molecule can be fused to other coding or regulatory sequences and still be considered 

1 5 isolated. 

For example, recombinant DNA molecules contained in a vector are considered isolated. 
Further examples of isolated DNA molecules include recombinant DNA molecules maintained in 
heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated 
RNA molecules include in vivo or in vitro RNA transcripts of the isolated DNA molecules of the 

20 present invention. Isolated nucleic acid molecules according to the present invention further include 
such molecules produced synthetically. 

Accordingly, the present invention provides nucleic acid molecules that consist of the 
nucleotide sequence shown in Figure 1 or 3 (SEQ ID NO:l, transcript sequence and SEQ ID NO:3, 
genomic sequence), or any nucleic acid molecule that encodes the protein provided in Figure 2, 

25 SEQ ID NO:2. A nucleic acid molecule consists of a nucleotide sequence when the nucleotide 
sequence is the complete nucleotide sequence of the nucleic acid molecule. 

The present invention further provides nucleic acid molecules that consist essentially of the 
nucleotide sequence shown in Figure 1 or 3 (SEQ ID NO :1, transcript sequence and SEQ ID NO:3, 
genomic sequence), or any nucleic acid molecule that encodes the protein provided in Figure 2, 

30 SEQ ID NO:2. A nucleic acid molecule consists essentially of a nucleotide sequence when such a 
nucleotide sequence is present with only a few additional nucleic acid residues in the final nucleic 
acid molecule. 
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The present invention further provides nucleic acid molecules that comprise the nucleotide 
sequences shown in Figure 1 or 3 (SEQ ID NO :1, transcript sequence and SEQ ID NO:3, genomic 
sequence), or any nucleic acid molecule that encodes the protein provided in Figure 2, SEQ ED 
NO:2. A nucleic acid molecule comprises a nucleotide sequence when the nucleotide sequence is at 

5 least part of the final nucleotide sequence of the nucleic acid molecule. In such a fashion, the 
. nucleic acid molecule can be only the nucleotide sequence or have additional nucleic acid residues, 
such as nucleic acid residues that are naturally associated with it or heterologous nucleotide 
sequences. Such a nucleic acid molecule can have a few additional nucleotides or can comprise 
several hundred or more additional nucleotides. A brief description of how various types of these 

1 0 nucleic acid molecules can be readily made/isolated is provided below. 

In Figures 1 and 3, both coding and non-coding sequences are provided. Because of the 
source of the present invention, humans genomic sequence (Figure 3) and cDNA/transcript 
sequences (Figure 1), the nucleic acid molecules in the Figures will contain genomic intronic 
sequences, 5' and 3' non-coding sequences, gene regulatory regions and non-coding intergenic 

15 sequences. In general such sequence features are either noted in Figures 1 and 3 or can readily 
be identified using computational tools known in the art. As discussed below, some of the non- 
coding regions, particularly gene regulatory elements such as promoters, are useful for a variety 
of purposes, e.g. control of heterologous gene expression, target for identifying gene activity 
modulating compounds, and are particularly claimed as fragments of the genomic sequence 

20 provided herein. 

The isolated nucleic acid molecules can encode the mature protein plus additional amino or 
carboxyl-terminal amino acids, or amino acids interior to the mature peptide (when the mature form 
has more than one peptide chain, for instance). Such sequences may play a role in processing of a 
protein from precursor to a mature form, facilitate protein trafficking, prolong or shorten protein 

25 half-life or facilitate manipulation of a protein for assay or production, among other things. As 
generally is the case in situ, the additional amino acids may be processed away from the mature 
protein by cellular enzymes. 

As mentioned above, the isolated nucleic acid molecules include, but are not limited to, the 
sequence encoding the transporter peptide alone, the sequence encoding the mature peptide and 

30 additional coding sequences, such as a leader or secretory sequence (e.g., a pre-pro or pro-protein 
sequence), the sequence encoding the mature peptide, with or without the additional coding 
sequences, plus additional non-coding sequences, for example introns and non-coding 5' and 3' 
sequences such as transcribed but non-translated sequences that play a role in transcription, mRNA 
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processing (including splicing and polyadenylation signals), ribosome binding and stability of 
mRNA. In addition, the nucleic acid molecule may be fused to a marker sequence encoding, for 
example, a peptide that facilitates purification. 

Isolated nucleic acid molecules can be in the form of RNA, such as mRNA, or in the form 
DNA, including cDNA and genomic DNA obtained by cloning or produced by chemical synthetic 
techniques or by a combination thereof. The nucleic acid, especially DNA, can be double-stranded 
or single-stranded. Single-stranded nucleic acid can be the coding strand (sense strand) or the non- 
coding strand (anti-sense strand). 

The invention further provides nucleic acid molecules that encode fragments of the peptides 
of the present invention as well as nucleic acid molecules that encode obvious variants of the 
transporter proteins of the present invention that are described above. Such nucleic acid molecules 
may be naturally occurring, such as allelic variants (same locus), paralogs (different locus), and ' 
orthologs (different organism), or may be constructed by recombinant DNA methods or by 
chemical synthesis. Such non-naturally occurring variants may be made by mutagenesis 
techniques, including those applied to nucleic acid molecules, ceUs, or organisms. Accordingly, as 
discussed above, the variants can contain nucleotide substitutions, deletions, inversions and 
insertions. Variation can occur in either or both the coding and non-coding regions. The variations 
can produce both conservative and non-conservative amino acid substitutions. 

The present invention further provides non-coding fragments of the nucleic acid molecules 
provided in Figures 1 and 3. Preferred non-coding fragments include, but are not limited to, 
promoter sequences, enhancer sequences, gene modulating sequences and gene termination 
sequences. Such fragments are useful in controlling heterologous gene expression and in 
developing screens to identify gene-modulating agents. A promoter can readily be identified as 
being 5 J tothe ATG start site in the genomic sequence provided in Figure 3. 

A fragment comprises a contiguous nucleotide sequence greater than 12 or more 
nucleotides. Further, a fragment could at least 30, 40, 50, 100, 250 or 500 nucleotides in length. 
The length of the fragment will be based on its intended use. For example, the fragment can encode 
epitope bearing regions of the peptide, or can be useful as DNA probes and primers. Such 
fragments can be isolated using the known nucleotide sequence to synthesize an oligonucleotide 
probe. A labeled probe can then be used to screen a cDNA library, genomic DNA library, or 
mRNA to isolate nucleic acid corresponding to the coding region. Further, primers can be used in 
PCR reactions to clone specific regions of gene. 
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A probe/primer typically comprises substantially a purified oligonucleotide or 
oligonucleotide pair. The oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, 20, 25, 40, 50 or more consecutive 
nucleotides. 

5 Orthologs, homologs, and allelic variants can be identified using methods well known in the 

art. As described in the Peptide Section, these variants comprise a nucleotide sequence encoding a 
peptide that is typically 60-70%, 70-80%, 80-90%, and more typically at least about 90-95% or 
more homologous to the nucleotide sequence shown in the Figure sheets or a fragment of this 
sequence. Such nucleic acid molecules can readily be identified as being able to hybridize under 

10 moderate to stringent conditions, to the nucleotide sequence shown in the Figure sheets or a 
fragment of the sequence. Allelic variants can readily be determined by genetic locus of the 
encoding gene. As indicated by the data presented in Figure 3, the map position was determined to 
be on chromosome 1 4 by ePCR. 

Figure 3 provides information on SNPs that have been identified in a gene encoding the 

15 transporter protein of the present inv ention. 140 SNP variants were found, including 6 indels 

(indicated by a and 1 SNPs in exons. The others were found in in introns and regions 5' and 
3' of the ORF. Such SNPs in introns and outside the ORF may affect control/regulatory 
elements. 

As used herein, the term "hybridizes under stringent conditions" is intended to describe 
20 conditions for hybridization and washing under which nucleotide sequences encoding a peptide at 
least 60-70% homologous to each other typically remain hybridized to each other. The conditions 
can be such that sequences at least about 60%, at least about 70%, or at least about 80% or more 
homologous to each other typically remain hybridized to each other. Such stringent conditions are 
known to those skilled in the art and can be found in Current Protocols in Moleculai' Biology, John 
25 Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. One example of stringent hybridization conditions are 
hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45C, followed by one or more 
washes in 0.2 X SSC, 0. 1% SDS at 50-65C. Examples of moderate to low stringency hybridization 
conditions are well known in the art. 



30 Nucleic Acid Molecule Uses 

The nucleic acid molecules of the present invention are useful for probes, primers, chemical 
intermediates, and in biological assays. The nucleic acid molecules are useful as a hybridization 
probe for messenger RNA, transcript/cDNA and genomic DNA to isolate full-length cDNA and 
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genomic clones encoding the peptide described in Figure 2 and to isolate cDNA and genomic 
clones that correspond to variants (alleles, orthologs, etc.) producing the same or related peptides 
shown in Figure 2. 140 SNPs, including 6 indels, have been identified in the gene encoding the 
transporter protein provided by the present invention and are given in Figure 3. 
5 The probe can correspond to any sequence along the entire length of the nucleic acid 

molecules provided in the Figures. Accordingly, it could be derived from 5' noncoding regions, the 
coding region, and 3' noncoding regions. However, as discussed, fragments are not to be construed 
as encompassing fragments disclosed prior to the present invention. 

The nucleic acid molecules are also useful as primers for PCR to amplify any given region 
10 of a nucleic acid molecule and are useful to synthesize antisense molecules of desired length and 
sequence. 

The nucleic acid molecules are also useful for constructing recombinant vectors. Such 
vectors include expression vectors that express a portion of, or all of, the peptide sequences. 
Vectors also include insertion vectors, used to integrate into another nucleic acid molecule 
1 5 sequence, such as into the cellular genome, to alter in situ expression of a gene and/or gene product. 
For example, an endogenous coding sequence can be replaced via homologous recombination with 
all or part of the coding region containing one or more specifically introduced mutations. 

The nucleic acid molecules are also useful for expressing antigenic portions of the proteins. 

The nucleic acid molecules are also useful as probes for determining the chromosomal 
20 positions of the nucleic acid molecules by means of in situ hybridization methods. As indicated by 
the data presented in Figure 3, the map position was determined to be on chromosome 14 by ePCR. 

The nucleic acid molecules are also useful in making vectors containing the gene regulatory 
regions of the nucleic acid molecules of the present invention. 

The nucleic acid molecules are also useful for designing ribozymes corresponding to all, or 
25 a part, of the mRNA produced from the nucleic acid molecules described herein. 

The nucleic acid molecules are also useful for making vectors that express part, or all, of the 
peptides. 

The nucleic acid molecules are also useful for constructing host cells expressing a part, or 
all, of the nucleic acid molecules and peptides. 
30 The nucleic acid molecules are also useful for constructing transgenic animals expressing 

all, or a part, of the nucleic acid molecules and peptides. 

The nucleic acid molecules are also useful as hybridization probes for determining the 
presence, level, form and distribution of nucleic acid expression. Experimental data as provided in 

38 

JSOOCIO < WO 0233086A2J _> 



WO 02/33086 



PCT/US01/32152 



Figure 1 indicates that sodium/calcium exchanger proteins of the present invention are expressed in 
humans in the heart, retina, kidney, fetal brain, and fetal heart. Specifically, a virtual northern blot 
shows expression in the fetal brain. In addition, PCR-based tissue screening panel indicates 
expression in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. 
5 Accordingly, the probes can be used to detect the presence of, or to determine levels of, a 

specific nucleic acid molecule in cells, tissues, and in organisms. The nucleic acid whose level is 
determined can be DNA or RNA. Accordingly, probes corresponding to the peptides described 
herein can be used to assess expression and/or gene copy number in a given cell, tissue, or 
organism. These uses are relevant for diagnosis of disorders involving an increase or decrease in 
1 0 transporter protein expression relative to normal results. 

In vitro techniques for detection of mRN A include Northern hybridizations and in situ 

. * * - s 

hybridizations. In vitro techniques for detecting DNA include Southern hybridizations and in situ 
hybridization. 

Probes can be used as a part of a diagnostic test kit for identifying cells or tissues that 

1 5 express a transporter protein, such as by measuring a level of a transporter-encoding nucleic acid in 
a sample of cells from a subject e.g., mRNA or genomic DNA, or determining if a transporter gene 
has been mutated. Experimental data as provided in Figure 1 indicates that sodium/calcium 
exchanger proteins of the present invention are expressed in humans in the heart, retina, kidney, 
fetal brain, and fetal heart. Specifically, a virtual northern blot shows expression in the fetal brain. 

20 In addition, PCR-based tissue screening panel indicates expression in brain, heart, kidney, lung, 
spleen, testis, leukocyte and fetal brain. 

Nucleic acid expression assays are useful for drug screening to identify compounds that 
modulate transporter nucleic acid expression: 

The invention thus provides a method for identifying a compound that can be used to treat a 

25 disorder associated with nucleic acid expression of the transporter gene, particularly biological and 
pathological processes that are mediated by the transporter in cells and tissues that express it. 
Experimental data as provided in Figure 1 indicates expression in humans in brain, heart, kidney, 
lung, spleen, testis, leukocyte and fetal brain. The method typically includes assaying the ability of 
the compound to modulate the expression of the transporter nucleic acid and thus identifying a 

30 compound that can be used to treat a disorder characterized by undesired transporter nucleic acid 
expression. The assays can be performed in cell-based and cell-free systems. Cell-based assays 
include cells naturally expressing the transporter nucleic acid or recombinant cells genetically 
engineered to express specific nucleic acid sequences. 
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The assay for transporter nucleic acid expression can involve direct assay of nucleic acid 
levels, such as mRNA levels, or on collateral compounds involved in the signal pathway. Further, 
the expression of genes that are up- or down-regulated in response to the transporter protein signal 
pathway can also be assayed. In this embodiment the regulatory regions of these genes can be. 
5 operably linked to a reporter gene such as luciferase. 

Thus, modulators of transporter gene expression can be identified in a method wherein a cell 
is contacted with a candidate compound and the expression of mRNA determined. The level of - 
expression of transporter mRNA in the presence of the candidate compound is compared to the 
level of expression of transporter mRNA in the absence of the candidate compound. The candidate 
1 0 compound can tlien be identified as a modulator of nucleic acid expression based on this 

comparison and be used, for example to treat a disorder characterized by aberrant nucleic acid 
expression. When expression of mRNA is statistically significantly greater in the presence of the 
candidate compound tlian in its absence, the candidate compound is identified as a stimulator of 
nucleic acid expression. When nucleic acid expression is statistically significantly less in the 
1 5 presence of the candidate compound than in its absence, the candidate compound is identified as an 
inhibitor of nucleic acid expression. 

The invention further provides methods of treatment, with the nucleic acid as a target, using 
a compound identified through drug screening as a gene modulator to modulate transporter nucleic 
acid expression in cells and tissues that express the transporter. Experimental data as provided in 
20 Figure 1 indicates that sodium/calcium exchanger proteins of the present invention are expressed in 
humans in the heart, retina, kidney, fetal brain, and fetal heart. Specifically, a virtual northern blot 
shows expression in the fetal brain. In addition, PCR-based tissue screening panel indicates 
expression in brain, heart, kidney, lung, spleen, testis, leukocyte and fetal brain. Modulation 
includes both up-regulation (i.e. activation or agonization) or down-regulation (suppression or 
25 antagonization) or nucleic acid expression. 

Alternatively, a modulator for transporter nucleic acid expression can be a small molecule or 
drug identified using the screening assays described herein as long as the drug or small molecule 
inhibits the transporter nucleic acid expression in the cells and tissues that express the protein. 
Experimental data as provided in Figure 1 indicates expression in humans in brain, heart, kidney, 
30 lung, spleen, testis, leukocyte and fetal brain. 

The nucleic acid molecules are also useful for monitoring the effectiveness of modulating 
compounds on the expression or activity of the transporter gene in clinical trials or in a treatment 
regimen. Thus, the gene expression pattern can serve as a barometer for the continuing 
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effectiveness of treatment with the compound, particularly with compounds to which a patient can 
develop resistance. The gene expression pattern can also serve as a marker indicative of a 
physiological response of the affected cells to the compound. Accordingly, such monitoring would 
allow either increased administration of the compound or the administration of alternative 
5 compounds to which the patient has not become resistant. Similarly, if the level of nucleic acid 
expression falls below a desirable level, administration of the compound could be commensurately 
decreased. ) 

The nucleic acid molecules are also useful in diagnostic assays for qualitative changes in 
transporter nucleic acid expression, and particularly in qualitative changes that lead to pathology. 
. 1 0 The nucleic acid molecules can be used to detect mutations in transporter genes and gene expression 
products such as mRNA. The nucleic acid molecules can be used as hybridization probes to detect 
naturally occurring genetic mutations in the transporter gene and thereby to determine whether a 
subject with the mutation is at risk for a disorder caused by the mutation. Mutations include 
deletion, addition, or substitution of one or more nucleotides in the gene, chromosomal 

1 5 rearrangement, such as inversion or transposition, modification of genomic DNA, such as aberrant 
methylation patterns or changes in gene copy number, such as amplification. Detection of a 
mutated form of the transporter gene associated with a dysfunction provides a diagnostic tool for an 
active disease or susceptibility to disease when the disease results from overexpression, 
underexpression, or altered expression of a transporter protein. 

20 Individuals carrying mutations in the transporter gene can be detected at the nucleic acid 

level by a variety of techniques. Figure 3 provides information on SNPs that have been identified in 
a gene encoding the transporter protein of the present invention. 140 SNP variants were found, 
including 6 indels (indicated by a "-") and 1 SNPs in exons. The others were found in in introns 
and regions 5' and 3' of the ORF. Such SNPs in introns and outside the ORF may affect 

25 control/regulatory elements. As indicated by the data presented in Figure 3, the map position was 
determined to be on chromosome 14 by ePCR. Genomic DNA can be analyzed directly or can be 
amplified by using PCR prior to analysis. RNA or cDNA can be used in the same way. In some 
uses, detection of the mutation involves the use of a probe/primer in a polymerase chain reaction 
(PCR) (see, e.g. U.S. Patent Nos. 4,683, 1 95 and 4,683,202), such as anchor PCR or RACE PCR, or, 

30 alternatively, in a ligation chain reaction (LCR) (see, e.g., Landegran et al, Science 247:1077-1080 
(1988); and Nakazawa et aL, PNAS 97:360-364 (1994)), the latter of which can be particularly 
useful for detecting point mutations in the gene (see Abravaya et al , Nucleic Acids Res. 23:675-682 
(1995)). This method can include the steps of collecting a sample of cells from a patient, isolating 
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nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically hybridize to a gene under conditions such that 
hybridization and amplification of the gene (if present) occurs, and detecting the presence or 
absence of an amplification product, or detecting the size of the amplification product and 
5 comparing the length to a control sample. Deletions and insertions can be detected by a change in 
size of the amplified product compared to the normal genotype. Point mutations can be identified 
by hybridizing amplified DNA to normal RNA or antisense DNA sequences. 

Alternatively, mutations in a transporter gene can be directly identified, for example, by 
alterations in restriction enzyme digestion patterns determined by gel electrophoresis. 

1 0 Further, sequence-specific ribozymes (U.S. Patent No. 5,498,53 1) can be used to score for 

the presence of specific mutations by development or loss of a ribozyme cleavage site. Perfectly 
matched sequences can be distinguished from mismatched sequences by nuclease cleavage 
digestion assays or by differences in melting temperature. 

Sequence changes at specific locations can also be assessed by nuclease protection assays 

1 5 such as RNase and SI protection or the chemical cleavage method. Furthermore, sequence 

differences between a mutant transporter gene and a wild-type gene can be determined by direct 
DNA sequencing. A variety of automated sequencing procedures can be utilized when performing 
the diagnostic assays (Naeve, C.W., (1995) Biotechniques 7P:448), including sequencing by mass 
spectrometry (see, e.g., PCT International Publication No. WO 94/16101; Cohen et al,Adv. 

20 Chromatogr. 35:127-162 (1996); and Griffin et al,Appl. Biochem. Biotechnol. 35:147-159 (1993)). 

Other methods for detecting mutations in the gene include methods in which protection 
from cleavage agents is used to detect mismatched bases in RNA/RNA or RNA/DNA duplexes 
(Myers et al, Science 230:1242 (1985)); Cotton etal., PNAS 85:4397 (1988); Saleeba et al , Meth 
Enzymol 2/7:286-295 (1992)), electrophoretic mobility of mutant and wild type nucleic acid is 

25 compared (Orita et al, PNAS 86:2766 (1989); Cotton etal., Mutat. Res. 255:125-144 (1993); and 
Hayashi et al, Genet. Anal. Tech. Appl. P:73-79 (1992)), and movement of mutant or wild-type 
fragments in polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (Myers et al, Nature 373:495 (1985)). Examples of other techniques 
for detecting point mutations include selective oligonucleotide hybridization, selective 

30 amplification, and selective primer extension. 

The nucleic acid molecules are also useful for testing an individual for a genotype that while 
not necessarily causing the disease, nevertheless affects the treatment modality. Thus, the nucleic 
acid molecules can be used to study the relationship between an individual's genotype and the 
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individual's response to a compound used for treatment (pharmacogenomic relationship). 
Accordingly, the nucleic acid molecules described herein can be used to assess the mutation content 
of the transporter gene in an individual in order to select an appropriate compound or dosage 
regimen for treatment. Figure 3 provides information on SNPs that have been identified in a gene 
5 encoding the transporter protein of the present invention. 140 SNP variants were found, including 6 
indels (indicated by a "-") and 1 SNPs in exons. The others were found in in introns and regions 5' 
and 3' of the ORF. Such SNPs in introns and outside the ORF may affect control/regulatory 
elements. 

Thus nucleic acid molecules displaying genetic variations that affect treatment provide a 

1 0 diagnostic target that can be used to tailor treatment in an individual. Accordingly, the production 
of recombinant cells and animals containing these polymorphisms allow effective clinical design of 
treatment compounds and dosage regimens. 

The nucleic acid molecules are thus useful as antisense constructs to control transporter gene 
expression in cells, tissues, and organisms. A DNA antisense nucleic acid molecule is designed to 

15 be complementary to a region of the gene involved in transcription, preventing transcription and 
hence production of transporter protein. An antisense RNA or DNA nucleic acid molecule would 
hybridize to the mRNA and thus block translation of mRNA into transporter protein. 

Alternatively, a class of antisense molecules can be used to inactivate mRNA in order to 
decrease expression of transporter nucleic acid. Accordingly, these molecules can treat a disorder 

20 characterized by abnormal or undesired transporter nucleic acid expression. This technique 

involves cleavage by means of ribozymes containing nucleotide sequences complementary to one or 
more regions in the mRNA that attenuate the ability of the mRNA to be translated. Possible regions 
include coding regions and particularly coding regions corresponding to the catalytic and other 
functional activities of the transporter protein, such as ligand binding. 

25 The nucleic acid molecules also provide vectors for gene therapy in patients containing cells 

that are aberrant in transporter gene expression. Thus, recombinant cells, which include the patient's 
cells that have been engineered ex vivo and returned to the patient, are introduced into an individual 
where the cells produce the desired transporter protein to treat the individual. 

The invention also encompasses kits for detecting the presence of a transporter nucleic acid 

30 in a biological sample. Experimental data as provided in Figure 1 indicates that sodium/calcium 
exchanger proteins of the present invention are expressed in humans in the heart, retina, kidney, 
fetal brain, and fetal heart. Specifically, a virtual northern blot shows expression in the fetal brain. 
In addition, PCR-based tissue screening panel indicates expression in brain, heart, kidney, lung, 
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spleen, testis, leukocyte and fetal brain. For example, the kit can comprise reagents such as a 
labeled or labelable nucleic acid or agent capable of detecting transporter nucleic acid in a 
biological sample; means for determining the amount of transporter nucleic acid in the sample; and 
means for comparing the amount of transporter nucleic acid in the sample with a standard. The 
5 compound or agent can be packaged in a suitable container. The kit can further comprise 
instructions for using the kit to detect transporter protein mRNA or DNA. 

Nucleic Acid Array s 

The present invention further provides nucleic acid detection kits, such as arrays or 

1 0 microarrays of nucleic acid molecules that are based on the sequence information provided in 
Figures 1 and 3 (SEQ ID NOS: 1 and 3). . ✓ 

As used herein "Arrays" or "Microarrays" refers to an array of distinct polynucleotides or 
oligonucleotides synthesized on a substrate, such as paper, nylon or other type of membrane, 
filter, chip, glass slide, or any other suitable solid support. In one embodiment, the microarray is 

15 prepared and used according to the methods described in US Patent 5,837,832, Chee et al, PCT 
application W095/1 1995 (Chee et al\ Lockhart, D. J. et al (1996; Nat. Biotech. 14: 1675-1680) 
and Schena, M. et al (1996; Proc. Natl. Acad. Sci. 93: 10614-10619), all of which are 
incorporated herein in their entirety by reference. In other embodiments, such arrays are 
produced by the methods described by Brown et al 7 US Patent No. 5,807,522. 

20 The microarray or detection kit is preferably composed of a large number of unique, 

single-stranded nucleic acid sequences, usually either synthetic antisense oligonucleotides or 
fragments of cDNAs, fixed to a solid support. The oligonucleotides are preferably about 6-60 
nucleotides in length, more preferably 15-30 nucleotides in length, and most preferably about 20- 
25 nucleotides in length. For a certain type of microarray or detection kit, it may be preferable to 

25 use oligonucleotides that are only 7-20 nucleotides in length. The microarray or detection kit 
may contain oligonucleotides that cover the known 5', or 3\ sequence, sequential 
oligonucleotides that cover the full length sequence; or unique oligonucleotides selected from 
particular areas along the length of the sequence. Polynucleotides used in the microarray or 
detection kit may be oligonucleotides that are specific to a gene or genes of interest. 

30 In order to produce oligonucleotides to a known sequence for a microarray or detection 

kit, the gene(s) of interest (or an ORF identified from the contigs of the present invention) is 
typically examined using a computer algorithm which starts at the 5' or at the 3' end of the 
nucleotide sequence. Typical algorithms will then identify oligomers of defined length that are 
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unique to the gene, have a GC content within a range suitable for hybridization, and lack 
predicted secondary structure that may interfere with hybridization. In certain situations it may 
be appropriate to use pairs of oligonucleotides on a microarray or detection kit. The "pairs" will 
be identical, except for one nucleotide that preferably is located in the center of the sequence. 
5 The second oligonucleotide in the pair (mismatched by one) serves as a control. The number of 
oligonucleotide pairs may range from two to one million. The oligomers are synthesized at 
designated areas on a substrate using a light-directed chemical process. The substrate may be 
paper, nylon or other type of membrane, filter, chip, glass slide or any other suitable solid 
support. 

10 In another aspect, an oligonucleotide may be synthesized on the surface of the substrate 

by using a chemical coupling procedure and an ink jet application apparatus, as described in PCT 
application W095/251 1 16 (Baldeschweiler et al) which is incorporated herein in its entirety by 
reference. In another aspect, a "gridded" array analogous to a dot (or slot) blot may be used to 
arrange and link cDNA fragments or oligonucleotides to the surface of a substrate using a 

15 vacuum system, thermal, UV, mechanical or chemical bonding procedures. An array, such as 
those described above, may be produced by hand or by using available devices (slot blot or dot 
blot apparatus), materials (any suitable solid support), and machines (including robotic 
instruments), and may contain 8, 24, 96, 384, 1536, 6144 or more oligonucleotides, or any other 
number between two and one million which lends itself to the efficient use of commercially 

20 available instrumentation. 

In order to conduct sample analysis using a microarray or detection kit, the RNA or DNA 
from a biological sample is made into hybridization probes. The mRNA is isolated, and cDNA is 
produced and used as a template to make antisense RNA (aRNA). The aRNA is amplified in the 
presence of fluorescent nucleotides, and labeled probes are incubated with the microarray or 

25 detection kit so that the probe sequences hybridize to complementary oligonucleotides of the 

microarray or detection kit. Incubation conditions are adjusted so that hybridization occurs with 
precise complementary matches or with various degrees of less complementarity. After removal 
of nonhybridized probes, a scanner is used to determine the levels and patterns of fluorescence. 
The scanned images are examined to determine degree of complementarity and the relative 

30 abundance of each oligonucleotide sequence on the microarray or detection kit. The biological 
samples may be obtained from any bodily fluids (such as blood, urine, saliva, phlegm, gastric 
juices, etc.), cultured cells, biopsies, or other tissue preparations. A detection system may be 
used to measure the absence, presence, and amount of hybridization for all of the distinct 
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sequences simultaneously. This data may be used for large-scale correlation studies on the 
sequences, expression patterns, mutations, variants, or polymorphisms among samples. 

Using such arrays, the present invention provides methods to identify the expression of 
the transporter proteins/peptides of the present invention. In detail, such methods comprise 
5 incubating a test sample with one or more nucleic acid molecules and assaying for binding of the 
nucleic acid molecule with components within the test sample. Such assays will typically 
involve arrays comprising many genes, at least one of which is a gene of the present invention 
and or alleles of the transporter gene of the present invention. Figure 3 provides information on 
SNPs that have been identified in a gene encoding the transporter protein of the present 

1 0 invention. 1 40 SNP variants were found, including 6 indels (indicated by a "-") and 1 SNPs in 
exons. The others were found in in introns and regions 5' and 3' of the ORF. Such SNPs in 
introns and outside the ORF may affect control/regulatory elements. 

Conditions for incubating a nucleic acid molecule with a test sample vary. Incubation 
conditions depend on the format employed in the assay, the detection methods employed, and the 

1 5 type and nature of the nucleic acid molecule used in the assay. One skilled in the art will 

recognize that any one of the commonly available hybridization, amplification or array assay 
formats can readily be adapted to employ the novel fragments of the Human genome disclosed 
herein. Examples of such assays can be found in Chard, T, An Introduction to 
Radioimmunoassay and Related Techniques, Elsevier Science Publishers, Amsterdam, The 

20 Netherlands (1986); Bullock, G. R. et al, Techniques in Immunocytochemistry, Academic 
Press, Orlando, FL Vol. 1 (1 982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 
Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1985). 

The test samples of the present invention include cells, protein or membrane extracts of 

25 cells. The test sample used in the above-described method will vary based on the assay format, 
nature of the detection method and the tissues, cells or extracts used as the sample to be assayed. 
Methods for preparing nucleic acid extracts or of cells are well known in the art and can be 
readily be adapted in order to obtain a sample that is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 

30 necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close 
confinement, one or more containers which comprises: (a) a first container comprising one of the 
nucleic acid molecules that can bind to a fragment of the Human genome disclosed herein; and 
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(b) one or more other containers comprising one or more of the following: wash reagents, 
reagents capable of detecting presence of a bound nucleic acid. 

In detail, a compartmentalized kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers, strips of 
5 plastic, glass or paper, or arraying material such as silica. Such containers allows one to 
efficiently transfer reagents from one compartment to another compartment such that the 
samples and reagents are not cross-contaminated, and the agents or solutions of each container 
can be added in a quantitative fashion from one compartment to another. Such containers will 
include a container which will accept the test sample, a container which contains the nucleic acid 

1 0 probe, containers which contain wash reagents (such as phosphate buffered saline, Tris-buffers, 
etc.), and containers which contain the reagents used to detect the bound probe. One skilled in 
the art will readily recognize that the previously unidentified transporter gene of the present 
invention can.be routinely identified using the sequence information disclosed herein can be 
readily incorporated into one of the established kit formats which are well known in the art, 

1 5 particularly expression arrays. 

Vectors/host cells 

The invention also provides vectors containing the nucleic acid molecules described herein. 
The term "vector" refers to a vehicle, preferably a nucleic acid molecule, which can transport the 
20 nucleic acid molecules. When the vector is a nucleic acid molecule, the nucleic acid molecules are 
covalently linked to the vector nucleic acid. With this aspect of the invention, the vector includes a 
plasmid, single or double stranded phage, a single or double stranded RNA or DNA viral vector, or 
artificial chromosome, such as a BAC, PAC, YAC, OR MAC. 

A vector can be maintained in the host cell as an extrachromosomal element where it 
25 replicates and produces additional copies of the nucleic acid molecules. Alternatively, the vector 

may integrate into the host cell genome and produce additional copies of the nucleic acid molecules 
when the host cell replicates. 

The invention provides vectors for the maintenance (cloning vectors) or vectors for 
expression (expression vectors) of the nucleic acid molecules. The vectors can function in 
30 procaryotic or eukaryotic cells or in both (shuttle vectors). 

Expression vectors contain cis-acting regulatory regions that are operably linked in the 
vector to the nucleic acid molecules such that transcription of the nucleic acid molecules is allowed 
in a host cell. The nucleic acid molecules can be introduced into the host cell with a separate 
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nucleic acid molecule capable of affecting transcription. Thus, the second nucleic acid molecule 
may provide a trans-acting factor interacting with the cis-regulatory control region to allow 
transcription of the nucleic acid molecules from the vector. Alternatively, a trans-acting factor may 
be supplied by the host cell. Finally, a trans-acting factor can be produced from the vector itself. It 
5 is understood, however, that in some embodiments, transcription and/or translation of the nucleic 
acid molecules can occur in a cell-free system. 

The regulatory sequence to which the nucleic acid molecules described herein can be 
operably linked include promoters for directing mRNA transcription. These include, but are not 
limited to, the left promoter from bacteriophage X 9 the lac, TRP, and TAC promoters from E, coli, 
1 0 the early and late promoters from S V40, the CMV immediate early promoter, the adenovirus early 
and late promoters, and retrovirus long-terminal repeats. 

In addition to control regions that promote transcription, expression vectors may also 
include regions that modulate transcription, such as repressor binding sites and enhancers. 
Examples include the SV40 enhancer, the cytomegalovirus immediate early enhancer, polyoma 
1 5 enhancer, adenovirus enhancers, and retrovirus LTR enhancers. 

In addition to containing sites for transcription initiation and control, expression vectors can 
also contain sequences necessary for transcription termination and, in the transcribed region a 
ribosome binding site for translation. Other regulatory control elements for expression include 
initiation and termination codons as well as polyadenylation signals. The person of ordinary skill in 
20 the art would be aware of the numerous regulatory sequences that are useful in expression vectors. 
Such regulatory sequences are described, for example, in Sambrook et al, Molecular Cloning: A 
Laboratory Manual. 2nd. ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
(1989). 

A variety of expression vectors can be used to express a nucleic acid molecule. Such 
25 vectors include chromosomal, episomal, and virus-derived vectors, for example vectors derived 
from bacterial plasmids, from bacteriophage, from yeast episomes, from yeast chromosomal 
elements, including yeast artificial chromosomes, from viruses such as baculoviruses, 
papovaviruses such as SV40, Vaccinia viruses, adenoviruses, poxviruses, pseudorabies viruses, and 
retroviruses. Vectors may also be derived from combinations of these sources such as those derived 
30 from plasmid and bacteriophage genetic elements, e.g. cosmids and phagemids. Appropriate 

cloning and expression vectors for prokaryotic and eukaryotic hosts are described in Sambrook et 
aL, Molecular Cloning: A Laboratory Manual. 2nd. ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, (1989). 

48 

JSDOCID: <WO_0233086A2_I_> 



WO 02/33086 



PCT/US01/32152 



The regulatory sequence may provide constitutive expression in one or more host cells (i.e. 
tissue specific) or may provide for inducible expression in one or more cell types such as by 
temperature, nutrient additive, or exogenous factor such as a hormone or other ligand. A variety of 
vectors providing for constitutive and inducible expression in prokaryotic and eukaryotic hosts are 
5 well known to those of ordinary skill in the art. 

The nucleic acid molecules can be inserted into the vector nucleic acid by well-known 
methodology. Generally, the DNA sequence that will ultimately be expressed is joined to an 
expression vector by cleaving the DNA sequence and the expression vector with one or more 
restriction enzymes and then ligating the fragments together. Procedures for restriction enzyme 

10 digestion and ligation arc well known to those of ordinary skill in the art. 

The vector containing the appropriate nucleic acid molecule can be introduced into an 
appropriate host cell for propagation or expression using well-known techniques. Bacterial cells 
include, but are not limited to. E colt, Streptomyces, and Salmonella typhimurinm. Eukaryotic cells 
include, but are not limited to. yeast, insect cells such as Drosophila, animal cells such as COS and 

15 CHO cells, and plant cells. 

As described herein, it may be desirable to express the peptide as a fusion protein. 
Accordingly, the invention provides fusion vectors that allow for the production of the peptides. 
Fusion vectors can increase the expression of a recombinant protein, increase the solubility of the 
recombinant protein, and aid in the purification of the protein by acting for example as a ligand for 

20 affinity purification. A proteolytic cleavage site may be introduced at the junction of the fusion 
moiety so that the desired peptide can ultimately be separated from the fusion moiety. Proteolytic 
enzymes include, but are not limited to, factor Xa, thrombin, and enterotransporter. Typical fusion 
expression vectors include pGEX (Smith et al, Gene 57:31-40 (1988)), pMAL (New England 
Biolabs, Beverly, MA) and pRJT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S- 

25 transferase (GST), maltose E binding protein, or protein A, respectively, to the target recombinant 
protein. Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amann 
et al, Gene 69:301-3 15 (19S8)) and pET 1 Id (Studier et aL, Gene Expression Technology: Methods 
in Enzymology 755:60-89 (1990)). 

Recombinant protein expression can be maximized in host bacteria by providing a genetic 

30 background wherein the host cell has an impaired capacity to proteolytically cleave the recombinant 
protein. (Gottesman, S., Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, California ( 1 990) 119-1 28). Alternatively, the sequence of the nucleic acid 
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molecule of interest can be altered to provide preferential codon usage for a specific host cell, for 
example E. coll (Wada et al , Nucleic Acids Res, 20:2 111-2118(1 992)). 

The nucleic acid molecules can also be expressed by expression vectors that are operative in 
yeast. Examples of vectors for expression in yeast e.g., S. cerevisiae include pYepSecl (Baldari, et 
5 aUEMBOJ. (5:229-234 (1987)), pMFa (Kurjan et al, Cell 30:933-943(1982)), pJRY88 (Schultze/ 
al , Gene 54: 1 1 3- 1 23 ( 1 987)), and p YES2 (Invitrogen Corporation, San Diego, CA). 

The nucleic acid molecules can also be expressed in insect cells using, for example, 
baculovirus expression vectors. Baculovirus vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al, Mol Cell Biol i:2156-2165 
1 0 (1 983)) and the pVL series (Lucklow et al , Virology 1 70:3 1 -39 (1 989)). 

In certain embodiments of the invention, the nucleic acid molecules described herein are 
expressed in mammalian cells using mammalian expression vectors. Examples of mammalian 
expression vectors include pCDM8 (Seed, B. Nature 329: 840(1 987)) and pMT2PC (Kaufinan et al, 
EMBOJ. (5:187-195 (1987)). 
1 5 The expression vectors listed herein are provided by way of example only of the well- 

known vectors available to those of ordinary skill in the art that would be useful to express the 
nucleic acid molecules. The person of ordinary skill in the art would be aware of other vectors 
suitable for maintenance propagation or expression of the nucleic acid molecules described herein. 
These are found for example in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A 
20 Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1 989. 

The invention also encompasses vectors in which the nucleic acid sequences described 
herein are cloned into the vector in reverse orientation, but operably linked to a regulatory sequence 
that permits transcription of antisense RNA. Thus, an antisense transcript can be produced to all, or 
25 to a portion, of the nucleic acid molecule sequences described herein, including both coding and 
non-coding regions. Expression of this antisense RNA is subject to each of the parameters 
described above in relation to expression of the sense RNA (regulatory sequences, constitutive or 
inducible expression, tissue-specific expression). 

The invention also relates to recombinant host cells containing the vectors described herein. 
30 Host cells therefore include prokaryotic cells, lower eukaryotic cells such as yeast, other eukaryotic 
cells such as insect cells, and higher eukaryotic cells such as mammalian cells. 

The recombinant host cells are prepared by introducing the vector constructs described 
herein into the cells by techniques readily available to the person of ordinary skill in the art. These 
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include, but are not limited to, calcium phosphate transfection, DEAE-dextran-mediated 
transfection, cationic lipid-mediated transfection, electroporation, transduction, infection, 
lipofection, and other techniques such as those found in Sambrook, et al {Molecular Cloning: A 
Laboratory Manual 2nd, ed, Cold Spring Harbor Labor atoiy, Cold Spring Harbor Laboratory 
5 Press, Cold Spring Harbor, NY, 1 989). 

Host cells can contain more than one vector. Thus, different nucleotide sequences can be 
introduced on different vectors of the same cell. Similarly, the nucleic acid molecules can be 
introduced either alone or with other nucleic acid molecules that are hot related to the nucleic acid 
molecules such as those providing trans-acting factors for expression vectors. When more than one 

10 vector is introduced into a cell, the vectors can be introduced independently, co-introduced or joined 
to the nucleic acid molecule vector. 

In the case of bacteriophage and viral vectors, these can be introduced into cells as packaged 
or encapsulated virus by standard procedures for infection and transduction. Viral vectors can be 
replication-competent or replication-defective. In the case in which viral replication is defective, 

1 5 replication will occur in host cells providing functions that complement the defects. 

Vectors generally include selectable markers that enable the selection of the subpopulation 
of cells that contain the recombinant vector constructs. The marker can be contained in the same 
vector that contains the nucleic acid molecules described herein or may be on a separate vector. 
Markers include tetracycline or ampicillin-resistance genes for prokaryotic host cells and 

20 dihydrofolate reductase or neomycin resistance for eukaryotic host cells. However, any marker that 
provides selection for a phenotypic trait will be effective. 

While the mature proteins can be produced in bacteria, yeast, mammalian cells, and other 
cells under the control of the appropriate regulatory sequences, cell- free transcription and 
translation systems can also be used to produce these proteins using RNA derived from the DNA 

25 constructs described herein. 

Where secretion of the peptide is desired, which is. difficult to achieve with multi- 
transmembrane domain containing proteins such as transporters, appropriate secretion signals are 
incorporated into the vector. The signal sequence can be endogenous to the peptides or 
heterologous to these peptides. 

30 Where the peptide is not secreted into the medium, which is typically the case with 

transporters, the protein can be isolated from the host cell by standard disruption procedures, 
including freeze thaw, sonication, mechanical disruption, use of lysing agents and the like. The 
peptide can then be recovered and purified by well-known purification methods including 
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ammonium sulfate precipitation, acid. extraction, anion or canonic exchange chromatography, 
phosphocellulose chromatography, hydrophobic-interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography, lectin chromatography, or high performance 
liquid chromatography. 

It is also understood that depending upon the host cell in recombinant production of the 
peptides described herein, the peptides can have various glycosylation patterns, depending upon the 
cell, or maybe non-glycosylated as when produced in bacteria. In addition, the peptides may 
include an initial modified methionine in some cases as a result of a host-mediated process. 

Uses of vectors and host cells 

The recombinant host cells expressing the peptides described herein have a variety of use's. 
First, the cells are useful for producing a transporter protein or peptide that can be further purified to 
produce desired amounts of transporter protein or fragments. Thus, host cells containing expression 
vectors are useful for peptide production. 

Host cells are also useful for conducting cell-based assays involving the transporter protein 
or transporter protein fragments, such as those described above as well as other formats known in 
the art. Thus, a recombinant host cell expressing a native transporter protein is useful for assaying 
compounds that stimulate or inhibit transporter protein function. 

Host cells arc also useful for identifying transporter protein mutants in which these functions 
are affected. If the mutants naturally occur and give rise to a pathology, host cells containing the 
mutations are useful to assay compounds that have a desired effect on the mutant transporter protein 
(for example, stimulating or inhibiting function) which may not be indicated by their effect on the 
native transporter protein. 

Genetically engineered host cells can be further used to produce non-human transgenic 
animals. A transgenic animal is preferably a mammal, for example a rodent, such as a rat or mouse, 
in which one or more of the cells of the animal include a transgene. A transgene is exogenous DNA 
that is integrated into the genome of a cell from which a transgenic animal develops and which 
remains in the genome of the mature animal in one or more cell types or tissues of the transgenic 
animal. These animals are useful for studying the function of a transporter protein and identifying 
and evaluating modulators of transporter protein activity. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, chickens, and amphibians. 

A transgenic animal can be produced by introducing nucleic acid into the male pronuclei of 
a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte to develop 
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in a pseudopregnant female foster animal. Any of the transporter protein nucleotide sequences can 
be introduced as a transgene into the genome of a non-human animal, such as a mouse. 

Any of the regulatory or other sequences useful in expression vectors can form part of the 
transgenic sequence. This includes intronic sequences and polyadenylation signals, if not already 
5 included. A tissue-specific regulatory sequence(s) can be operably linked to the transgene to direct 
expression of the transporter protein to particular cells. 

Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et al, U.S. Patent No. 

1 0 4.S73. 1 9 1 by Wagner et al and in Hogan, B., Manipulating the Mouse Embryo, (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N. Y., 1 986). Similar methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified based upon 
the presence of the transgene in its genome and/or expression of transgenic mRNA in tissues or 
cells of the animals. A transgenic founder animal can then be used to breed additional animals 

1 5 carrying the transgene. Moreover, transgenic animals carrying a transgene can further be bred to 
other transgenic animals carrying other transgenes. A transgenic animal also includes animals in 
which the entire animal or tissues in the animal have been produced using the homologously 
recombinant host cells described herein. 

In another embodiment, transgenic non-human animals can be produced which contain 

20 selected systems that allow for regulated expression of the transgene. One example of such a 

system is the cre/loxP recombinase system of bacteriophage PI . For a description of the cre/loxP 
recombinase system, see, e.g., Lakso et al PNAS 89:6232-6236 (1992). Another example of a 
recombinase system is the FLP recombinase system of S. cerevisiae (O'Gorman et al. Science 
251 :1351-1355 (1991). If a cre/loxP recombinase system is used to regulate expression of the 

25 transgene, animals containing transgenes encoding both the Cre recombinase and a selected protein 
is required. Such animals can be provided through the construction of "double" transgenic animals, 
e.g., by mating two transgenic animals, one containing a transgene encoding a selected protein and 
the other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 

30 according to the methods described in Wilmut, I. et al Nature 355:810-813 (1997) and PCT 

International Publication Nos. WO 97/07668 and WO 97/07669. In brief, a cell, e.g., a somatic cell, 
from the transgenic animal can be isolated and induced to exit the growth cycle and enter G 0 phase. 
The quiescent cell can then be fused, e.g., through the use of electrical pulses, to an enucleated 
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oocyte from an animal of the same species from which the quiescent cell is isolated. The 
reconstructed oocyte is then cultured such that it develops to morula or blastocyst and then 
transferred to pseudopregnant female foster animal. The offspring bom of this female foster animal 
will be a clone of the animal from which the cell, e.g., the somatic cell, is isolated. . 
5 Transgenic animals containing recombinant cells that express the peptides described herein 

are useful to conduct the assays described herein in an in vivo context. Accordingly, the various 
physiological factors that are present in vivo and that could effect ligand binding, transporter protein 
activation, and signal transduction, may not be evident from in vitro cell-free or cell-based assays. 
Accordingly, it is useful to provide non-human transgenic animals to assay in vivo transporter 

1 0 protein function, including ligand interaction, the effect of specific mutant transporter proteins on 
transporter protein function and ligand interaction, and the effect of chimeric transporter proteins v It 
is also possible to assess the effect of null mutations, that is mutations that substantially or 
completely eliminate one or more transporter protein functions. 

All publications and patents mentioned in the above specification are herein incorporated 

15 by reference. Various modifications and variations of the described method and system of the 
invention will be apparent to those skilled in the art without departing from the scope and spirit 
of the invention. Although the invention has been described in connection with specific 
preferred embodiments, it should be understood that the invention as claimed should not be 
unduly limited to such specific embodiments. Indeed, various modifications of the above- 

20 described modes for carrying out the invention which are obvious to those skilled in the field of 
molecular biology or related fields are intended to be within the scope of the following claims. 
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Claims 

That which is claimed is: 

1 . An isolated peptide consisting of an amino acid sequence selected from the group 
consisting of: 

(a) an amino acid sequence shown in SEQ ED NO:2; 

(b) an amino acid sequence of an allelic variant of an amino acid sequence 
shown in SEQ ID NO:2, wherein said allelic variant is encoded by a nucleic acid molecule that 
hybridizes under stringent conditions to the opposite strand of a nucleic acid molecule shown in 
SEQIDNOS:! or 3; 

(c) an amino acid sequence of an ortholog of an amino acid sequence shown in 
SEQ ID NO:2, wherein said ortholog is encoded by a nucleic acid molecule that hybridizes under 
stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:l or 3; 
and 

(d) a fragment of an amino acid sequence shown in SEQ ID NO:2, wherein said 
fragment comprises at least 10 contiguous amino acids. 

2. An isolated peptide comprising an amino acid sequence selected from the group 
consisting of: 

(a) an amino acid sequence shown in SEQ ED NO:2; 

(b) an amino acid sequence of an allelic variant of an amino acid sequence 
shown in SEQ ED NO:2, wherein said allelic variant is encoded by a nucleic acid molecule that 
hybridizes under stringent conditions to the opposite strand of a nucleic acid molecule shown in 
SEQ IDNOS:lor3; 

(c) an amino acid sequence of an ortholog of an amino acid sequence shown in 
SEQ ID NO:2, wherein said ortholog is encoded by a nucleic acid molecule that hybridizes under 
stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:l or 3; 
and 

(d) a fragment of an amino acid sequence shown in SEQ ID NO:2, wherein said 
fragment comprises at least 10 contiguous amino acids. 
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3 . An isolated antibody that selectively binds to a peptide of claim 2. 

4. An isolated nucleic acid molecule consisting of a nucleotide sequence selected from 
the group consisting of: 

(a) a nucleotide sequence that encodes an amino acid sequence shown in SEQ 

ID NO:2; 

(b) a nucleotide sequence that encodes of an allelic variant of an amino acid 
sequence shown in SEQ ID NO:2, wherein said nucleotide sequence hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOSrl or 3; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid sequence 
shown in SEQ ID NO:2, wherein said nucleotide sequence hybridizes under stringent conditions^ 
the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:l or 3; 

(d) a nucleotide sequence that encodes a fragment of an amino acid sequence 
shown in SEQ ID NO:2, wherein said fragment comprises at least 10 contiguous amino acids; and 

(e) a nucleotide sequence that is the complement of a nucleotide sequence of 

(aHd). 

5. An isolated nucleic acid molecule comprising a nucleotide sequence selected from 
the group consisting of: 

(a) a nucleotide sequence that encodes an amino acid sequence shown in SEQ 

ID NO:2; 

(b) a nucleotide sequence that encodes of an allelic variant of an amino acid 
sequence shown in SEQ ID NO:2, wherein said nucleotide sequence hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:l or 3; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid sequence 
shown in SEQ ID NO:2, wherein said nucleotide sequence hybridizes under stringent conditions to 
the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:l or 3; 

(d) a nucleotide sequence that encodes a fragment of an amino acid sequence 
shown in SEQ ID NO:2 5 wherein said fragment comprises at least 10 contiguous amino acids; and 

(e) a nucleotide sequence that is the complement of a nucleotide sequence of 

(a)-(d). 
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6. A gene chip comprising a nucleic acid molecule of claim 5. 

7. A transgenic non-human animal comprising a nucleic acid molecule of claim 5. 

8. A nucleic acid vector comprising a nucleic acid molecule of claim 5. 

9. A host cell containing the vector of claim 8. 

10. A method for producing any of the peptides of claim 1 comprising introducing a 
nucleotide sequence encoding any of the amino acid sequences in (a)-(d) into a host cell, and 
culturing the host cell under conditions in which the peptides are expressed from the nucleotide , 
sequence. 

11. A method for producing any of the peptides of claim 2 comprising introducing a 
nucleotide sequence encoding any of the amino acid sequences in (a)-(d) into a host cell, and 
culturing the host cell under conditions in which the peptides are expressed from the nucleotide 
sequence. 

12. A method for detecting the presence of any of the peptides of claim 2 in a sample, 
said method comprising contacting said sample with a detection agent that specifically allows 
detection of the presence of the peptide in the sample and then detecting the presence of the peptide. 

13. A method for detecting the presence of a nucleic acid molecule of claim 5 in a 
sample, said method comprising contacting the sample with an oligonucleotide that hybridizes to 
said nucleic acid molecule under stringent conditions and determining whether the oligonucleotide 
binds to said nucleic acid molecule in the sample. 

14. A method for identifying a modulator of a peptide of claim 2, said method 
comprising contacting said peptide with an agent and determining if said agent has modulated the 
function or activity of said peptide. 
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15. The method of claim 1 4, wherein said agent is administered to a host cell comprising 
an expression vector that expresses said peptide. 

1 6. A method for identifying an agent that binds to any of the peptides of claim 2, said 
method comprising contacting the peptide with an agent and assaying the contacted mixture to 
determine whether a complex is formed with the agent bound to the peptide. 

1 7. A pharmaceutical composition comprising an agent identified by the method of 
claim 16 and a pharmaceutically acceptable carrier therefor. 

18. A method for treating a disease or condition mediated by a human transporter . ✓ 
protein, said method comprising administering to a patient a pharmaceutically effective amount of 
an agent identified by the method of claim 16. 

1 9. A method for identifying a modulator of the expression of a peptide of claim 2, said 
method comprising contacting a cell expressing said peptide with an agent, and determining if said 
agent has modulated the expression of said peptide. 

20. An isolated human transporter peptide having an amino acid sequence that shares at 
least 70% homology with an amino acid sequence shown in SEQ ID NO:2. 

21 . A peptide according to claim 20 that shares at least 90 percent homology with an 
amino acid sequence shown in SEQ ID NO:2. 

22. An isolated nucleic acid molecule encoding a human transporter peptide, said 
nucleic acid molecule sharing at least 80 percent homology with a nucleic acid molecule shown in 
SEQIDNOSil or 3. 

23. A nucleic acid molecule according to claim 22 that shares at least 90 percent 
homology with a nucleic acid molecule shown in SEQ ED NOS: 1 or 3. 
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1 GTCTCGTGTA TGGCGTGGTT AAGGTTGCAG CCTCTCACCT CTGCCTTCCT 
51 CCATTTTGGG CTGGTTACCT TTGTGCTCTT CCTGAATGGT CTTCGAGCAG 
101 AGGC TGGTGG CTCAGGGGAC GTGCCAAGCA CAGGGCAGAA CAATGAGTCC 
151 TGTTCAGGGT CATCGGACTG CAAGGAGGGT GTCATCCTGC CAATCTGGTA 
201 CCCGGAGAAC CCTTCCCTTG GGGACAAGAT TGCCAGGGTC ATTGTCTATT 
251 TTGTGGCCCT GATATACATG TTCCTTGGGG TGTCCATCAT TGCTGACCGC 
301 TTCATGGCAT CTATTGAAGT CATCACCTCT CAAGAGAGGG AGGTGACAAT 
351 TAAGAAACCC AATGGAGAAA CCAGCACAAC CACTATTCGG GTCTGGAATG 
401 AAACTGTCTC CAACCTGACC CTTATGGCCC TGGGTTCCTC TGCTCCTGAG 
4 51 ATACTCCTCT CTTTAATTGA GGTGTGTGGT CATGGGTTCA TTGCTGGTGA 
501 TCTGGGACCT TCTACCATTG TAGGGAGTGC AGCCTTCAAC ATGTTCATCA 
551 TCATTGGCAT CTGTGTCTAC GTGATCCCAG ACGGAGAGAC TCGCAAGATC 
601 AAGCATCTAC GAGTCTTCTT CATCACCGCT GCTTGGAGTA TCTTTGCCTA 
651 CATCTGGCTC TATATGATTC TGGCAGTCTT CTCCCCTGGT GTGGTCCAGG 
701 TTTGGGAAGG CCTCCTCACT CTCTTCTTCT TTCCAGTGTG TGTCCTTCTG 
751 GCCTGGGTGG CAGATAAACG ACTGCTCTTC TACAAATACA TGCACAAAAA 
801 GTACCGCACA GACAAACACC GAGGAATTAT CATAGAGACA GAGGGTGACC 
8 51 ACCCTAAGGG CATTGAGATG GATGGGAAAA TGAT GAATTC CCATTTTCTA 
901 GATGGGAACC TGGTGCCCCT GGAAGGGAAG GAAGTGGATG AGTCCCGCAG 
951 AGAGATGATC CGGATCCTCA AGGATCTGAA GCAAAAACAC CCAGAGAAGG 
1001 ACTTAGATCA GCTGGTGGAG ATGGCCAATT ACTATGCTCT TTCCCACCAA 
1051 CAGAAGAGCC GCGCCTTCTA CCGTATCCAA GCCACTCGTA TGATGACTGG 
1101 TGCAGGCAAT ATCCTGAAGA AACATGCAGC AGAACAAGCC AAGAAGGCCT 
1151 CCAGCATGAG CGAGGTGCAC ACCGATGAGC CTGAGGACTT TATTTCCAAG 
1201 GTCTTCTTTG ACCCATGTTC TTACCAGTGC CTGGAGAACT GTGGGGCTGT 
1251 ACTCCTGACA GTGGTGAGGA AAGGGGGAGA CATGTCAAAG ACCATGTATG 
1301 TGGACTACAA AACAGAGGAT GGTTCTGCCA ATGCAGGGGC TGACTATGAG 
1351 TTCACAGAGG GCACGGTGGT TCTGAAGCCA GGAGAGACCC AGAAGGAGTT 
14 01 CTCCGTGGGC AT AATTG AT G ACGACATTTT TGAGGAGGAT GAACACTTCT 
14 51 TTGTAAGGTT GAGCAATGTC CGCATAGAGG AGGAGCAGCC AGAGGAGGGG 
1501 ATGCCTCCAG CAATATTCAA CAGTCTTCCC TTGCCTCGGG CTGTCCTAGC 
1551 CTCCCCTTGT GTGGCCACAG TTACCATCTT GGATGATGAC CATGCAGGCA 
1601 TCTTCACTTT TGAATGTGAT ACTATTCATG TCAGTGAGAG TATTGGTGTT 
1651 ATGGAGGTCA AGGTTCTGCG GACATCAGGT GCCCGGGGTA CAGTCATCGT 
1701 CCCCTTTAGG ACAGTAGAAG GGACAGCCAA GGGTGGCGGT GAGGACTTTG 
17 51 AAGACACATA TGGGGAGTTG GAATTC AAG A ATGATGAAAC TGTGAAAACC 
1801 ATAAGGGTTA AAATAGTAGA TGAGGAGGAA TACGAAAGGC AAGAGAATTT 
1851 CTTCATTGCC CTTGGTGAAC CGAAATGGAT GGAACGTGGA ATATCAGATG 
1901 TG AC AG AC AG GAAGCTGACT AT GG AAG AAG AGGAGGCCAA GAGGATAGCA 
1951 GAGATGGGAA AGCCAGTATT GGGTGAACAC CCCAAACTGG AAGTCATCAT 
2001 TGAAGAGTCC TATGAGTTCA AGACTACGGT GGACAAACTG AT C AAG AAG A 
2051 CAAACCTGGC CTTGGTTGTG GGGACCCATT CCTGGAGGGA CCAGTTCATG 
2101 GAGGCCATCA CCGTCAGTGC AGCAGGGGAT GAGGATGAGG ATGAATCCGG 
2151 GGAGGAGAGG CTGCCCTCCT GCTTTGACTA CGTCATGCAC TTCCTGACTG 
2201 TCTTCTGGAA GGTGCTGTTT GCCTGTGTGC CCCCCACAGA GTACTGCCAC 
2251 GGCTGGGCCT GCTTCGCCGT CTCCATCCTC ATCATTGGCA TGCTCACCGC 
2 301 CATCATTGGG GACCTGGCCT CGCACTTCGG CTGCACCATT GGTCTCAAAG 
2351 ATTCGGTCAC AGCTGTTGTT TTCGTGGCAT TTGGCACCTC TGTCCCAGAT 
2401 ACGTTTGCCA GCAAAGCTGC TGCCCTCCAG GATGTATATG CAGACGCCTC 
2 4 51 CATTGGCAAC GTGACGGGCA GCAACGCCGT CAATGTCTTC CTGGGCATCG 
2 501 GCCTGGCCTG GTCCGTGGCC GCCATCTACT GGGCTCTGCA GGGACAGGAG 
2551 TTCCACGTGT CGGCCGGCAC ACTGGCCTTC TCCGTCACCC TCTTCACCAT 
2 601 CTTTGCATTT GTCTGCATCA GCGTGCTCTT GTACCGAAGG CGGCCGCACC 
2651 TGGGAGGGGA GCTTGGTGGC CCCCGTGGCT GCAAGCTCGC CACAACATGG 
2701 CTCTTTGTGA GCCTGTGGCT CCTCTACATA CTCTTTGCCA CACTAGAGGC 
27 51 CTATTGCTAC ATCAAGGGGT TCTAAGCCAC AC 
(SEQ ID NO: 1) 



5'UTR: 1-9 

Start Codon: 10 

Stop Codon: 2773 

3'UTR: 2776 
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HOMOLOGOUS PROTEINS: 
Top 10 BLAST Hits: 

Sequences producing significant alignments: 



CRAI 18000005047237 
CRA I 18000005200270 
CRAI 1000682343796 
CRA] 18000004939788 
CRA| 18000005028314 
CRA| 18000004968774 
CRA | 18000004882912 
CRAI 18000005218648 
CRA| 18000005218651 
CRAI 18000004907324 

dbEST: 



/altid=gi | 2498054 /def«sp | P70549 | NAC3_RAT SO. 
/altid=gi | 4140706 /def«gb | AAD04173 . 1 1 (AF107. 
/altid=gi | 64 53729 /def=gb | AAF08988 . 1 | AF108389 . 
/altid=gi 1 1083801 /def^pir 1 1 S43730 Na+/Ca2+-. 
/altid=gi 11279782 /def=gb | AAA97928 . 1 | (U5266. 
/altid=gi | 382752 /def=prf | | 1901175A Na/Ca ex. 
/altid=gi | 627801 /def =pir | I B53335 Na+/Ca2+-e. 
/altid=gi | 4 566522 /def =gb | AAD23386 . 1 1 AF10916 . 
/altid=gi | 4566528 /def=gb | AAD23389 . 1 | AF10916 . 
/ altid=gi | 479177 /def=pir | | S32435 Na+/Ca2+-e. 



Sequences producing significant alignments: 
gi 1 11600765 /dataset=dbest /taxon=96... 
gi I 318815 /dataset=dbest /taxon=9606 /... 



Score E 
(bits) Value 



1828 
1342 
1338 
1335 
1334 
1333 
1331 
1330 
1329 
1328 



0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 
0.0 



Score E 

(bits) Value 

500 e-138 

216 2e-53 



EXPRESSION INFORMATION FOR MODULATORY USE: 

^•!^ifo?I 65 r. /?°i ed . (Brain ' Heart ' Kidney, Lung, Spleen, Testis, Leukocyte) 
go. 1 318815 Fetal brain J 

Tissue expression: 

Pooled tissues (Brain, Heart, Kidney, Lung, Spleen, Testis, Leukocyte) 
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1 MAWLRLQPLT SAFLHFGLVT FVLFLNGLRA EAGGSGDVPS TGQNNESCSG 
51 SSDCKEGVIL PIWYPENPSL GDKIARVIVY FVALIYMFLG VSIIADRFMA 
101 SIEVITSQER EVTIKKPNGE TSTTTIRVWN ETVSNLTLMA LGSSAPEILL 
151 SLI EVCGHGF IAGDLGPSTI VGSAAFNMFI IIGICVYVIP DGETRKIKHL 
201 RVFFITAAWS IFAYIWLYMI LAVFSPGVVQ VWEGLLTLFF FPVCVLLAWV 
251 ADKRLLFYKY MHKKYRTDKH RGIIIETEGD HPKGIEMDGK MMNSHFLDGN 
301 LVPLEGKEVD ESRREMIRIL KDLKQKHPEK DLDQLVEMAN YYALSHQQKS 
351 RAFYRIQATR MMTGAGNILK KHAAEQAKKA SSMSEVHTDE PEDFISKVFF 
401 DPCSYQCLEN CGAVLLTVVR KGGDMSKTMY VDYKTEDGSA NAGADYEFTE 
451 GTVVLKPGET QKEFSVGI ID DDIFEEDEHF FVRLSNVRIE EEQPEEGMPP 
501 AIFNSLPLPR AVLASPCVAT VTILDDDHAG IFTFECDTIH VSESIGVMEV 
551 KVLRTSGARG TVIVPFRTVE GTAKGGGEDF EDTYGELEFK NDETVKTIRV 
601 KIVDEEEYER OENFFIALGE PKWMERGISD VTDRKLTMEE EEAKRIAEMG 
651 KPVLGEHPKL EVITEESYEF KTTVDKLIKK TNLALWGTH SWRDQFMEAI 
701 TVSAAGDEDE DESGEERLPS CFDYVMHFLT VFWKVLFACV PPTEYCHGWA 
751 CFAVSILIIG MLTAI IGDLA SHFGCTIGLK DSVTAWFVA FGTSVPDTFA 
801 SKAAALQDVY ADASIGNVTG SNAVNVFLGI GLAWSVAAIY WALQGQEFHV 
851 SAGTLAFSVT LFTI FAFVCI SVLLYRRRPH LGGELGGPRG C KLAT TWL FV 
901 SLWLLYILFA TLEAYCYIKG F 
(SEQ ID NO:2) 

FEATURES: 

Functional domains and k*y rtgions : 

[l) pdocooooi psotoo: acs jlvcdsylation 
N-glycosylation 

Number of matches: 4 

1 45-49 NF.SC 

2 130-131 Nrr. 

3 135-13? ti UTL 

4 817-820 WVTC 



[2] PDOC00004 PS00cC4 CTA*i F'MOT. TM" SITE 

cAMP- and cGMP-dep*r.vi* r.x }.:ct?ir. Ur.dse phosphorylation site 

Number of matches : ? 

1 378-381 KlJ<L 

2 634-637 P.K LT 



(31 PDOC00005 PS0000!> I* K C _ I* M F HC _ 5 1 TE 
Protein kinase C phospi*'--: ,1 uiion site 

Number of matches: 11 



1 


113- 


115 


TIK 


2 


125- 


127 


TI* 


3 


597- 


•599 


TIF 


4 


194- 


•196 


TRK 


5 


267-269 


TDK 


6 


312-314 


SRR 


7 


460-462 


TQK 


8 


572- 


•57 4 


TAK 


9 


594-596 


TVK 


10 


125-127 


TIR 


11 


597- 


-599 


TIR 
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[4 J PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 



Number of 


matches : 


16 


1 


69-72 


SLGD 


2 


106-109 


TSQE 


3 


144-147 


SAPE 


4 


151-154 


SLIE 


5 


277-280 


TEGD 


6 


312-315 


SRRB 


7 


382-385 


SMSE 


8 


460-463 


TQKE 


9 


522-525 


TILD 


10 


583-586 


TYGE 


11 


637-640 


TMEE 


12 


672-675 


TTVD 


13 


691-694 


SWRD 


14 


713-716 


SGEE 


15 


720-723 


SCFD 


16 


794-797 


SVPD 



(5 J PDOC00007 PS00007 TYR__PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

Number of matches: 2 

1 397-405 KVFFDPCSY 

2 601-608 KIVDEEEY 



[6] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 



Number of 


matches : 


15 


1 


50-55 


GSSDCK 


*> 


422-427 


GGDMSK 


3 


438-443 


G SAN AG 


4 


497-502 


GMPPAI 


5 


557-562 


GARGTV 


6 


571-576 


GTAKGG 


7 


760-765 


GMLTAI 


8 


774-779 


GCTIGL 


9 


778-783 


GLKDSV 


10 


816-821 


GNVTGS 


11 


829-834 


GIGLAW 


12 


831-836 


GLAWSV 


13 


882-887 


GGELGG 


14 


886-891 


GGPRGC 


15 


890-895 


GCKLAT 



Membrane spanning structure and domai ns: 
Helix Begin 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 



8 
76 
133 
169 
206 
231 
505 
723 
747 
781 
823 
854 
893 



End 
28 
96 
153 
189 
226 
251 
525 
743 
7 67 
801 
843 
874 
913 



Score Certainity 
1.905 Certain 
2.032 Certain 
1.009 Certain 
1.943 Certain 
2.118 Certain 
2.072 Certain 
0.666 Putative 
1.298 Certain 
2.258 Certain 
1.232 Certain 
1.793 Certain 
2.424 Certain 
2.138 Certain 
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SDCC1D: <WO_0233086A2_L> 



WO 02/33086 PCT/USO 1/32 152 

BLAST Alignment to Top Hit: 

>CRA| 18000005047237 /altid-gi I 2498054 /def=sp| P7054 9 1 NAC3_RAT 

SODIUM /CALCIUM EXCHANGER 3 PRECURSOR ( NA+ /CA2+- EXCHANGE 
PROTEIN 3) /org=NA+/CA2+-EXCHANGE PROTEIN 3 /dataset^nraa 
/length=927 
Length = 927 

Score =* 1828 bits (4 682), Expect =0.0 

Identities = 897/927 (96%), Positives ~ 911/927 (97%), Gaps = 6/927 (0%) 
Frame » +1 

Query: 10 MAWLRLQPLTSAFLHFGLVTFVLFLNGLRAEAGGSGDVPSTGQNNESCSGSSDCKEGVIL 189 

MAWLRLQPLTSAFLHFGLVTFVLFLNGLRAEAG DVPS GQNNESCSGSSDCKEGVIL 
Sbjct: 1 MAWLRLQPLTSAFLHFGLVTFVLFLNGLRAEAGDLRDVPSAGQNNESCSGSSDCKEGVIL 60 

Query: 190 PIWYPENPSLGDKIARVIVYFVALI YMFLGVSIIADRFMASIEVITSQEREVTIKKPNGE 369 

PIWYPENPSLGDKIARVIVYFVALIYMFLGVSIIADRFMASIEVITSQEREVTIKKPNGE 
Sbjct ; 61 PIWYPENPSLGDKIARVI VYFVALI YMFLGVSIIADRFMASIEVITSQEREVTIKKPNGE 120 

Query: 370 TSTTTI RVWNETV SNLTLMALGSSAPEI LLSLI EVCGHGFI AGDLGPSTI VGSAAFNMFI 54 9 

TSTTTIRVWNETVSNLTLMALGSSAPEILLSLIEVCGHGFIAGDLGPSTIVGSAAFNMFI 
Sbjct: 121 TSTTTI RVWNETVSNLTLMALGSSAPEI LLSLI EVCGHGFI AGDLGPSTI VGSAAFNMFI 180 

Query: 550 IIGICVYVIPDGETRKIKHLRVFFITAAWSIFAYIWLYMILAVFSPGVVQVWEGLLTLFF 729 

IIGICVYVIPDGETRKIKHLRVFF+TAAWS+FAYIWLYMILAVFSPGVVQVWEGLLTLFF 
Sbjct: 181 IIGICVYVIPDGETRKIKHLRVFFVTAAWSVFAYIWLYMILAVFSPGVVQVWEGLLTLFF 240 

Query: 730 FPVCVLLAWVADKRLLFYKYMHKKYRTDKHRGIIIETEGDHPKGIEMDGKMMNSHFLDGN 909 

FP VC VLL AW V A D K RL LFYKYMHK+Y RT DKHRG III ET EG+ H P KG I EMDG KMMN S H FL DGN 
Sbjct: 241 FPVCVLLAW VADKRLLFYKYMHKRYRT DKHRGI 1 1 ETEGEHPKGI EMDGKMMN SHFLDGN 300 

Query: 910 LVPLEGKEVDESRREMIRILKDLKQKHPEKDLOQLVEMANYYALSHQQKSRAFYRIQATR 1089 

L+PLEGKEVDESRREMIRILKDLKQKHPEKDLDQLVEMANYYALSHQQKSRAFYRIQATR 
Sbjct: 301 LIP LEG KE V DE S RREM I RI LK DLKQKH P EK DLDQLV EMAN Y Y ALS HQQKS RAFY RI QAT R 360 

Query: 1090 MMTGAGNILKKHAAEQAKKASSMSEVHTDEPEDFISKVFFDPCSYQCLENCGAVLLTWR 1269 

MMTGAGNILKKHAAEQAKK +SMSEVHTDEPEDF SKVFFDPCSYQCLENCGAVLLTWR 
Sbjct: 361 MMTGAGNILKKHAAEQAKKTASMSEVHTDEPEDFASKVFFDPCSYQCLENCGAVLLTVVR 420 

Query: 1270 KGGDMSKTMYVDYKTEDGSANAGADYEFTEGTVVLKPGETQKEFSVGIIDDDIFEEDEHF 1449 

KGGD+SKTMYVDYKTEDGSANAGADYEFTEGTWLKPGETQKEFSVGIIDDDIFEEDEHF 
Sbjct: 421 KGGDISKTMYVDYKTEDGSANAGADYEFTEGTVVLKPGETQKEFSVGIIDDDI FEEDEHF 480 

Query: 1450 FVRLSNVRIEEEQPEEGMPPAIFNSLPLPRAVLASPCVATVTILDDDHAGIFTFECDTIH 1629 

FVRLSNVR+EEEQ EEGM PAI NSLPLPRAVLASPCVATVTILDDDHAGIFTFECDTIH 
Sbjct: 4 81 FVRLSNVRVEEEQLEEGMTPAILNSLPLPRAVLASPCVATVTILDDDHAGIFTFECDTIH 540 

Query: 1630 VSESIGVMEVKVLRTSGARGTVIVPFRTVEGTAKGGGEDFEDTYGELEFKNDETVKTIRV 1809 

VSESIGVMEVKVLRTSGARGTVIVPFRTVEGTAKGGGEDFEDTYGELEFKNDETVKTIRV 
Sbjct: 541 VSESIGVMEVKVLRTSGARGTVIVPFRTVEGTAKGGGEDFEDTYGELEFKNDETVKTIRV 600 

Query: 1810 KIVDEEEYERQENFFIALGEPKWMERGIS D VT DRKLTME E EEAKRI AEMGK P VL 1971 

KIVDEEEYERQENFFIALGEPKWMERGIS +VTDRKLTMEEEEAKRIAEMGKPVL 
Sbjct: 601 KIVDEEEYERQENFFIALGEPKWMERGISALLLSPEVTDRKLTMEEEEAKRIAEMGKPVL 660 

Query: 1972 GEHPKLEVIIEESYEFKTTVDKLIKKTNLALVVGTHSWRDQFMEAITVSAAGDEDEDESG 2151 

GEHPKLEVIIEESYEFK+TVDKLIKKTNLALVVGTHSWRDQFMEAITVSAAGDE+EDESG 
Sbjct: 661 GEHPKLEVIIEESYEFKSTVDKLIKKTNLALVVGTHSWRDQFMEAITVSAAGDEEEDESG 720 

Query: 2152 EERLPSCFDYVMHFLTVFWKVLFACVPPTEYCHGWACFAVSILIIGMLTAIIGDLASHFG 2331 

EERLPSCFDYVMHFLTVFWKVLFACVPPTEYCHGWACF VSI LI IGMLTAI IGDLASHFG 
Sbjct: 721 EERLP S C FD YVMH FLT V FW KV L F AC V P PT EY C HGW AC FW S I LI I GMLT AI I G DL AS H FG 780 

Query: 2332 CTIGLKDSVTAVVFVAFGTSVPDTFASKAAALQDVYADASIGNVTGSNAVNVFLGIGLAW 2511 

CTI GLKDS VTA V VFVA FGTS V P DT FAS KAAALQDV Y ADAS I GNVTGS NAVNVFLG I GLAW 
Sbjct: 781 CT I GLKD S VT A W FV AFGT S V P DT FAS KAAALQDV Y AD A S I GN VT G SN AVN V FLG I GLAW 840 

Query: 2512 S V AAI YW ALQGQE FH V S AGT LA F S VTL FT I FA FVC I S VLL Y RRRPH LGGELGG P RGC KLA 2691 

SVAAIYWA+QGQEFHVSAGTLAFSVTLFTIFAFVC+SVLLYRRRPHLGGELGGPRGCKLA 
Sbjct: 841 SVAAIYWAMQGQEFHVSAGTLAFSVTLFTIFAFVCLSVLLYRRRPHLGGELGGPRGCKLA 900 

Query: 2692 TTWLFVSLWLLYI LFATLEA YCYI KGF 2772 

TTWLFVSLWLLY+LFATLEAYCYIKGF 
Sbjct: 901 TTWLFVSLWLLYVLFATLEAYCYIKGF 927 (SEQ ID NO: 4) 



Homer search results (Pf am) 
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Scores for sequence family classification (score 
Model "Description 



PF01699 Sodium/calcium exchanger protein 
PF00324 Amino acid permease 
PF01971 Protein of unknown function 

Parsed for domains: 



Model 


Domain 


seq-f 


seq-t 


h.mm— f 


hmm-t 




PF01699 


1/2 


118 


257 . 


12 


152 


■ 3 


PF01971 


1/1 


644 


670 . 


193 


222 


PF00324 


1/1 


851 


877 . 


472 


4 98 




PF01699 


2/2 


757 


905 . 


1 


152 


n 



PCTYUS01/32152 



includes all domains) : 

Score E-value N 



294.6 1.2e-84 2 
2.8 5.9 1 

2.7 8.7 1 



score E-value 

121.3 1.8e-32 

2.7 8.7 

2.8 5.9 

181.4 1.5e-50 



MSDOCIO: <WO 0233086A2_I_> 
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1 TTGGATGAGA TCTAAAGCAT TATTAAGAGT GGGGAGTGCA AAGAAGAAAC 
51 CCTCATTTCA AAGATGAATG AGAATAATGG CATGTACAAA GGTCCTGGGG 
101 TGGACAGTCA CTTGGTATAA TCCAAGAGTG AACCTGAAGG CTATTGTTGT 
151 TGAAATGTAA TAAGGGAGAG AGTGACGGGA TGAAGGGGGA TGAGTGGGAA 
201 GCAGTGAATT CCTGCAAGGC TTTGAAGGTC ATGGGAAAGA ATTTGGTCTT 
251 TATATCAAGA GCAAGAGAAG ACTACTAAAG GGCTTCAAAC AGGGGAGCGA 
301 TATGCTTAAG TCTGTTTGTT TGTTTTTTTA AAAAAAGATT ACGGTGGCTA 
351 TATGAGGAAA GTGGAATTGA GAACTAGCGA GAGTTGGAGT GGTGAGCTCC 
401 ATTAGGAGGC TACTGAAGTA GATTCATGAG GTAAGGAGTG ATGGTGGCCT 
4j1 GGGCTGGGAT GATGGTGGTA GAAATGGAGA AAGAGTTGAT AGGATTTAGT 
501 GATTGGATAA GGGACAGAAG AGAGATGAAG GCTTTCAGAC TAACATCTGC 
551 TTTCTAACAT GAGTAACTGG GTGGCTGAAG ATGCTATTTT CTGAGCTGGG 
601 AAACAGGAGA AAAAGGAGCA AATATGGGGG ATGAAGACTT TGAGTCTTTA 
651 AGGTGCTGTA CAAACACAAA TCAGCATTCC TTTATTACTA AGGGTATCCC 
"7 01 ACACAGTTGT AGCAGAGGGA GAAAGATCGC CCCCCCCCCA CTTTTTTTTT 
7 51 TTTTTTAGCT ATTCCATGGT ATTTTCATTC TCATCCCACC CAAATGAGGC 
801 AGTGAGTGGT AAGATGAGTA TATAATAGTT TCAATTGCAT TTCATCCCAT 
851 TCTTCTGAGC TCAAGCTCAC CTTTTAGTGG TTTGAGGCCA GTAGATGAAG 
5*01 CTGCATATCA CCCCCAAAAT CTTGTCTCTA GTTTAACAAA ACTTATTTGA 
951 CAGACATTTG CATGTTTTAT TAATAATGAT TTTTACCACT TGTTCCTTTC 
1001 CA7CTTTGGG TTTGAAATTT GAGTGGCTGG CGGATGATCA TCTTCCTGTT 
1051 ACTGCCTGCT TAAACTGCTC ATAAGCAGGT TTTACTGGAG GGCTCAGAGC 
1101 TGCTGTGAAC TTGGTCTTGG GCACAACTTA CATGGCCTCT GTTTGGCTAT 
1151 GGGCTGGGTG GCATTCACCA TTTATCAACT CTTTTGATTT CCCAAGCTAT 
1201 CTCA3AATTA TAGCTTGCCT CCAGAAGTCT TGCATTCGGG GAGGAAGTTT 
12 51 CTTTCCAAGG GAGCTCAGTT TTCAAGGTTT ATTGCTCTGT TTAATGGATG 
1301 AGATCTAAAG CATTATTAAG AGTGGGGAGT GCAAAGAAGA AACACTCATT 
1351 TC AAAATCG A TTGAGAATAA TGGCATGTAC AAAGGTCCTG GGGTGGACAG 
14 01 TCACTTGGTA TAATCCTGGA GTGAACATGA AGGCCAAGGA AATATGTATA 
14 51 CATTAAACAG AGCAAGGTTT TCAATTTTCT GGGG AC TAGT CCATGAAAAT 
1501 TCAATTCAAT ATACTCTCTT GCAAACCTAT GTTATCCAAG ATACTCAAGT 
1551 ATAATGACAA CAGGGTAAGG AAGTCCGAAC AC CCC AG AAA CAGTATAAAT 
1601 GGGCATGAAG ATTCAGGTTA TACATGGCCT ATTTTAAGTT GCTTCTTGAG 
1651 AACTCTCACA GGTAATACCA GTTTGGGAGA CAGGACTTGA AGGCTATTGC 
1701 TGCATTTCCA TCCCCAGTAT TCCCAGCTAT TTCAAGCCA? TTTTCAACGG 
1751 AGTCTCCACC AGATGGTTTG GAGGACAGAG CAGCTATTTG TGCCTCCCAT 
1801 TGACATCTAT TTTTCCAAGT GAGAGACTGC CCCATATGTT AGTGCAATAT 
1851 GTCACTGGAG GTGAAGCATC AGTTGTATTG GTGGGAACCT GCCGTTTGCT 
1901 GTCCCCTTTT TCCTCATGCC TTTTCCTGCC TCTCTGATCT TTTCTAGGTC 
1951 TCTGGCCTAT CAGGAGGACA ACTGGTGCTG CAATAGAAGC CAGTGGCTAA 
2001 GTCTCGTGTA TGGCGTGGTT AAGGTTGCAG CCTCTCACCT CTGCCTTCCT 
2051 CCATTTTGGG CTGGTTACCT TTGTGCTCTT CCTGAATGGT CTTCGAGCAG 
2101 AGGCTGGTGG CTCAGGGGAC GTGCCAAGCA CAGGGCAGAA CAATGAGTCC 
2151 TGTTCAGGGT CATCGGACTG CAAGGAGGGT GTCATCCTGC CAATCTGGTA 
2201 CCCGGAGAAC CCTTCCCTTG GGGACAAGAT TGCCAGGGTC ATTGTCTATT 
2251 TTGTGGCCCT GATATACATG TTCCTTGGGG TGTCCATCAT TGCTGACCGC 
2301 TTCATGGCAT CTATTGAAGT CATCACCTCT CAAGAGAGGG AGGTGACAAT 
2351 TAAGAAACCC AATGGAGAAA CCAGCACAAC AACTATTCGG GTCTGGAATG 
2401 AAACTGTCTC CAACCTGACC CTTATGGCCC TGGGTTCCTC TGCTCCTGAG 
24 51 ATACTCCTCT CTTTAATTGA GGTGTGTGGT CATGGGTTCA TTGCTGGTGA 
2501 TCTGGGACCT TCTACCATTG TAGGGAGTGC AGCCTTCAAC ATGTTCATCA 
2551 TCATTGGCAT CTGTGTCTAC GTGATCCCAG ACGGAGAGAC TCGCAAGATC 
2601 AAGCATCTAC GAGTCTTCTT CATCACCGCT GCTTGGAGTA TCTTTGCCTA 
2 651 CATCTGGCTC TATATGATTC TGGCAGTCTT CTCCCCTGGT GTGGTCCAGG 
2701 TTTGGGAAGG CCTCCTCACT CTCTTCTTCT TTCCAGTGTG TGTCCTTCTG 
27 51 GCCTGGGTGG CAGATAAACG ACTGCTCTTC TACAAATACA TGCACAAAAA 
2801 GTACCGCACA GACAAACACC GAGGAATTAT CATAGAGACA GAGGGTGACC 
2851 ACCCTAAGGG CATTGAGATG GATGGGAAAA TGATGAATTC CCATTTTCTA 

2 901 GATGGGAACC TGGTGCCCCT GGAAGGGAAG GAAGTGGATG AGTCCCGCAG 
2951 AGAGATGATC CGGATTCTCA AGGATCTGAA GCAAAAACAC CCAGAGAAGG 
3001 ACTTAGATCA GCTGGTGGAG ATGGCCAATT ACTATGCTCT TTCCCACCAA 
3051 CAGAAGAGCC GCGCCTTCTA CCGTATCCAA GCCACTCGTA TGATGACTGG 
3101 TGCAGGCAAT ATCCTGAAGA AACATGCAGC AGAACAAGCC AAGAAGGCCT 
3151 CCAGCATGAG CGAGGTGCAC ACCGATGAGC CTGAGGACTT TATTTCCAAG 
3201 GTCTTCTTTG ACCCATGTTC TTACCAGTGC CTGGAGAACT GTGGGGCTGT 
3251 ACTCCTGACA GTGGTGAGGA AAGGGGGAGA CATGTCAAAG ACCATGTATG 

3 301 TGGACTACAA AACAGAGGAT GGTTCTGCCA ATGCAGGGGC TGACTATGAG 
3351 TTCACAGAGG GCACGGTGGT TCTGAAGCCA GGAGAGACCC AGAAGGAGTT 
34 01 CTCCGTGGGC ATAATTGATG ACGACATTTT TGAGGAGGAT GAACACTTCT 
3451 TTGTAAGGTT GAGCAATGTC CGCATAGAGG AGGAGCAGCC AGAGGAGGGG 
3501 ATGCCTCCAG CAATATTCAA CAGTCTTCCC TTGCCTCGGG CTGTCCTAGC 
3 551 CTCCCCTTGT GTGGCCACAG TTACCATCTT GGATGATGAC CATGCAGGCA 
3601 TCTTCACTTT TGAATGTGAT ACTATTCATG TCAGTGAGAG TATTGGTGTT 
3651 ATGGAGGTCA AGGTTCTGCG GACATCAGGT GCCCGGGGTA CAGTCATCGT 
3701 CCCCTTTAGG ACAGTAGAAG GGACAGCCAA GGGTGGCGGT GAGGACTTTG 
3751 AAGACACATA TGGGGAGTTG GAATTCAAGA ATGATGAAAC TGTGTAAGTA 
3801 ACCTTCCTGT ATTCTGCCCC TCCCTGACCC CATCTTTTGC CATCTCTTTC 
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3851 TGTCTTTCTG TACTGCACTT TACAACATTT CCTTGTGTTT GTGTTAATGT 
3901 CAAACTTTGG TTCCATCACA GGTATGCAGG ATCAGCAGAC ACCACTGGAC 
3951 AGGTTCTGCT TCCAAACTCT TCTTCAGTTT TCTCACTTTA AATTGTTTCT 
4001 GGGCAAGGAA TCCTGTGACA AGAGCTAAGG ACACAAAACA TTTTCTTCTC 
4051 TGAAACACAA AATGATAGCT GGTGGAGCTG TGGGATGACA GAAGTTTTGT 
4101 GATATCAGAT TTTGGAGAAT TCTTGTGACT AAGAAGGACT AGAGAACTGC 
4151 TTGGGCCTCT TCTTCCTCCC TTCCTCATAT GAAGGGTATC TATGAGCTTT 
4201 GAAACCAATC CTTTCCATTC TGGGCAGCAA TAGCCCATCA GAACATTCTA 
4251 AAGAAAACAA GTGGCATTGG CTTTGTTCCC TGGTACTATA TTGCCAGTCT 
4 301 CACTGTGTAA CCAGATTCCA GGCACGTCTT CTTTAATTTG GAAATTGCAA 
4351 AATTGATAGA AATTTAGCAA TCTTTTTAAA TGACCATAGA CTATTTAATG 
4 401 GTGTGAGGCT TGCCCAGCCT AGTTGAATTG AGTCAGTATG GTTTGGATAC 
44 51 TGGAAAGTAT CTTGGAGAAG CAGAGCTCCC AGGGCAGTGG CTACTTGTCT 
4501 TTAGTCACAG GTCTAAGCTC CAAAATCTGG TGAAGCAGTG AAGGAGAAAC 
4551 ATCCTAGGAA TTGTGGGAGG AAATATATCT TCTGTGTGGT CCTCTCTTTT 
4 601 CACAGTCTAG GACTCTCCTG AAGTACCTCT TCTTGGGCTA CTGCCCCATT 
4651 CAGCCCTTCA GAAACTGTGG GTATTACACT TCTGTCACCT CTATTACCCT 
4701 AAGGCCTCTG CCCATTGAAC CCTCTTGCAA ATTGGTTATT CTGTCCTTTT 
4751 TCCAGTTGGA TAGCTTTAAA AGGGAAAGCA GAATGACTTT CCTCAGGATT 
4 801 TGTAGCTTAT GAGAAAGTAG ACTTTCTTGG GTGGCCTAGA AGGTTGGAGA 
4851 AGACAAACGG GAACTTCCTC TGAATGACTG AACATATCCA CAAATAATAA 
4901 GCGTGGCAGG AGATGGTGTG AAGAGTAAAA GGAGCATATA GGAAGTTGTG 
4 951 TGTGTGGGGT GTCTGTTTCA AGAACCTGCT AATTATACCT TCAGTAAGAA 
5001 ATGAAGCCAT ACAACCTCTA GAAGAGGAGG AGGAAGGAAC TCATGGAAAA 
5051 GTGGGGAGCC ATAGAAGCTA GGGAGAGGTG TCCTAGGAGT GCTTCTGCCC 
5101 AGGTCCAGCC ATGAGACAGA GCTCAAAAAG AGCTGGGCAC TGCTGGTGAC 
5151 AGAACTGAGT GACCCGGGGG ATCCTGCATC TGTTCTTACT CAATCCCTTC 
5201 TTAATAATGT GACTTGGGGC AGGTCATTTA TTGGTTCTGG AACTTAACTT 
5251 TCTGATATGC AAACTGGGAA TAACAATACT TTCCTTGCCT GGAGGCAAGG 
5301 TCAGTCCTTT TTGCAGTTCC TTCCAGCTCT AAGATTTTCT GAACCATAGA 
5351 CATAAGCACT CAGTGTAGGT CATATTCGCA CTTGCCAAAA ATGGATCAGG 
54 01 GAATATTGTC TCCTGAAGGG AAATGGCCAT TGACAAATTG ATTT AT TAG A 
54 51 GCTCTGTTTA GTCATTTTGC TGGGAAGGAT AATCATTTGT TAACGTAAGT 
5501 AGAAACCTGT GCCTTCTGGA GAATACTATC CATTTATATG TACTCTGGGG 
5551 AGAGTGTTTA T AC AT AC AAA TGAAGGACAG GGCTTCACTG GGAAAACAAA 
5601 CTCCATGGAA TTTCACATGA TTATCGCGAT GTCAGTGTGG AAGAAGATAT 
5651 GGTAAGGCAT TAAATGACAT TAAGACCACA AAATTTGCCA TAATTTGACG 
5701 GACTTGTGGT TCTTCTGATT CAGAACCCTT TCTACCCATG TCACGGATAG 
5751 GTAGTTTTTC AG AG ATC AG A GGCTTAGTTC ATTCTATTAA TTTCCTCATT 
5801 CTATTAATAA TCAATTATGC ACCTAGGGTC TCTGAATACG ACTAAACCTT 
5851 CCTCAAACTT ATTTGCATTT TCAGTTTGTA TAATATCTTG GTGCAAATGA 
5901 GCCTCGCAAA TGATCACTTC TGGGTAATAC TCATTCTAAA GGTATGTCAA 
5951 CCTTGAGAAT TCTGGTCTAG ATATTCTAGG GTTTGGTGAA CAAATCTATG 
6001 TTCCCATCCA TCCCTTTTCA TTTATTTTTT AGACTTCATT CATTGCAGAA 
6051 TAATGAGTCC AAAACCTGCT CATCTGTTCT CACGTGGCAC CCCTATTCTT 
6101 GATATTTTAA ATTGCAATTT TACAACTAGA GGCAGTATTA CGGAGCAGAA 
6151 AAATCGTGGG TTCTAAGTAC TCTGGGTTAG GATTCTGGCT CCACTACTGA 
6201 TTTAATAATG TAGTTTGGGG AAATTTTATT AACCTATGAA ATT ATTT CCT 
6251 CATTGGCAAA ATGGGGATAA TAATATCTCT CTTGCAGGGC CATTATGACG 
6301 ATTCAAGGTA TTGTATGCGG TGTACCTGGT ACACGGTATA TGCTCAGGAA 
6351 ACAAGACTCT TCATAGTAAT ATTGACGAAT TAACAATATT CTTCAGAAGA 
6401 CACTGTGGAG TTGTTTAGGT TACTTGGCTC TTTGTGTGAC CCTAAGTAAT 
6451 GAGCATGCCA GTTTGGGGTT ACTATGAAGA GTACTTACCT AAACTCATAA 
6501 AAT AT TAG AG CTAGAAAGGA CCTTAGAATA TCTTCTGCAG TCATGGTTCT 
6551 TAAATTTTAA TGTGTTGCTC AATCATCCAG GGATCTCACT GAAGGGCAGA 
6601 TTAGGATCCA GGAGGTCTAG GGGAGGGATT GAGATTCCGC ATTTCTAACA 
6651 AGTTCTGGAT GCTGCGGGCC CCAACTTAGA GGTGAAAGGT TCTGAAGCTC 
6701 TTGACCAAAC CAGGAGACCC AGCAAAGAAG TGGTTTTTCA GACAACTTGC 
6751 TTAATTGAAT AATGATTGTT TGCTCTTTAA TTCCAACTTT CAATGCCAAT 
6801 TTAGCAAGAA CCAGAGGCTG TGCTAATTGC CACACCAGTC TGGAAACCGA 
6851 AATGGATAGC TTCAGGGTAC TTGGACAAAG TTGGAACATC TGCTTTCTAA 
6901 TCTCTCCCTC TTTGTATAGC TTTATTTGCC TACCAAGCCT GGTAGTATTG 
6951 AAAATCTGCC CTCACTATAC TCCCCTAAAT ATAATCAAGT TGAGGCCAGG 
7001 CCTGTGCTCT ATC AAT A AT A TAGGATCCAC GAATTCACAT GTTTGGTTTT 
7051 ATGCTTTACT TCTTCAAAGG TGCTTTTAGC AGCATGGAAG AATGGAAAAG 
7101 CACGAGCTTT GGAATATGAA AGCAGATGTG AATCCATCAC TTACCAGTAA 
7151 CTTTTAACAA GTCACATCAC TTTTCTGAGT ACCAGGTTTT TGTTGGACAA 
7201 CAGAAATAAT ATTCTCTATC CTTCAAGGGA ATACTAAATA TAAGTATGAG 
7251 AAAAATGCAC AGTGCCTTCT CGTAGATGGT GTTCAGTCAT TCAACAAACA 
7301 TTTGTTAGAT ATTTGCTATG TACTAGCTAC ATTACTAGGC ACTGGGGTTA 
7351 AATAAGTGAA TAAGACAAGC TGACATTTCA GCGCTCAAGG ATCTTACTGT 
7 401 CAAGTGGAGA GGATCAAAGG GTACAGACAA ATCAAGGAAC GTGAGAGAAG 
7 451 TGGTATGGCT GAGATGGATT GAATAAAGGA GCAATGAGAG CTCCCTGCAA 
7501 TGTGTGTGGT ACCACTGAGG ATTCTAAATT AACCTTCATT AAGGACTTAG 
7 551 TAGTGACAGA GGTGAAGTGG GGATAGGTAC ATGATTAATT TACATCCATA 
7 601 TTACAATGAA ACCTTAACAT TTAAGAGGGA TATTATTGAT GTCTTCATGA 
7 651 TCCAGAAGAA TCCTCACCTT TGCAACCATC ACTATAGTCA CTTCTTGAGA 
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7701 ATTATGGCCT TTAAGACTGT AGCATGCAAT GACAAAACCT CACAGAGGTA 
7751 TGGGTTCTGC CCGCACACTA ATTTCACTCA TTAAACAAGT GACTGGCTCC 
7801 TATATCCCAG GCTCTCAGCA CGCCTTTGCA AAATAACAGA TTATTGCAGC 
7851 TCTTGGACCT TTGATGCCTC TGGGAATAGT CAAAGCCACA GATGTCAAAT 
7901 ATGTAAATGC CAAGATCTAT TATAATTAAA TAGTGCAGGC CTCCTTCAAA 
7 951 GAAAAAAAGC ATGTTGGCTG TGCTGCACGT TCTCCAACCA AATCAGAATG 
8001 TTAAAGCTCG AAGGTATCTG ACCTCCCATT TTTTAAATTA TGAAGATGAA 
8051 ATTCAGAAAG GGAAGGTAAC TTATCCAAGA TTACATGGCT AGCTATGATA 
8101 GAAAGTTAGA GTTGGAAAGG ACGTTAGAAA GTGAGGGTTT GAAAGGACTT 
8151 TAGAAGCTGC TTATTCAATG TTCTCTCTGC CCTTTCCCAT CTTAGGCTTC 
8201 TCCATTTTAC TTTTATCCAT CAATAAAATG TTAACTTCAA AAAGAATATG 
8251 GCAATTCTTG GGTAAAAGAT GCTCTGGAAG TGTGAGTCCG GGAGTATTAT 
8301 GTGACTAATG TCTTAACTAA GAATAATAAT ATATTATGGA CTAGTTTTAA 
8351 TCTCTTGTTT CACCTTGAAC TGTTCAGGAA GGAAAATAGC CCACGGAAAT 
8401 TTTTTAAAAA GTCTTTCTCT ATCTGAATTG AGAAAAGGTG ACAGGCATAG 
84 51 TTGGAACATC TTTTAGGCAG TGCTGGTGAA CTTCAGGCTA GGCCTTGTTC 
8501 CATGAAATAA TAAAAATTTT CAAAATAATG CAGACCATTC CCTTCCAGGG 
8551 ATGCTTTCTC TGTAATGTTT TAACCCCAAG AAATCTTTCT GTAAAAATCT 
8601 ATAAAAATCT GGAGTGTTCC AGGATACAAT TTGCACATTC TCCAATTTAA 
8651 CTAAAACACA ATCGATTTTT TGTTTTCTTT TTCTTTGGCT TAGCAAGGTT 
8701 TTAAGATAGT CTCTTTCTGG C C AC AG AGGG AGATGATTTG CCTCTAGAAT 
8751 ACCCTTTCTG TGCTTGAGAG AGTCACAAGA CTGCAAGCTC ATGGAGGATG 
8801 AGAGTCAAGT AGAGGTGGTG ACATCTCTCC CTTGGCCAAC ATCCCTCTCT 
8851 TTCTCTTTCC TTCTGCCTTC AGTGGCAGTA GCAAAAGTCC TCCTTCTCTT 
8901 TAGGTAGACA GTCAGCCACT ACAACTGTGG CTTCCTGAAA TCCTCAGTGG 
8951 AGCTATGTAC TTGGCACAGA TTTGTCTTGA AGAAGGGACT CCATTTCTGA 
9001 GCCAGTTGTT GAATGGGGAT ACTTAGCAGT ACAGTGAGGC ATTTCCAGTA 
9051 GGATTGTTCA ACCACAATTG CCCACTTTCC AGGCCCAAAG GAATAATTGA 
9101 AGGCTATGTA GACTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTGAGATGG 
9151 AGTCTCGCTC TGTCGCCCAG GCTGGAGTGC AGTGGCACAT CTCGGCTCAC 
9201 TGCAAGCTCT GCCTCCCGGG TTCACGCCAT TCTCCTGCCT CAGCCTCCCG 
9251 AGTAGCTAGG CCTAATATAT AT AT ATT AT A CATATATATT TAT ATT TAT A 
9301 TATATATATA CCACCACGTC CGGCTAATAT ATATTTATAC TTTTTTTTTT 
9351 TAGTAGGAAA GGGGTTTCAC CATGTTAGCC AGTATGGTCT CGATCTCCTG 
9401 ACCTCGTGAT CCACCAGCCT CAGCCTCCCA AAGTGCTGGG ATTACAGGCG 
94 51 TGAGCCACCG TGCCCGACCA TGCTATGTAA ACTTTTTAGC AG AAGCTTT A 
9501 GCTATTGTGT CCCGAAGGGC CCCAGGTCAT GATGAAATGT CTTTTTTTTT 
9551 TTTTGTCTCT TTTCTTCTTA ATTACTGAGA CTGTCAAAGA ATATGTCAAA 
9 601 GCATGACATA TTCCAACTCC AGGATCCATA AAACACCCCA AGTTCTGTGG 
9651 AGACCCTATC ACATCTGCAA AACTCTCCAG GAAGTCCAGA GCCCTCCTGG 
9701 TTAATTTGTT TT AGGG ACT A GGCATGCGGT ATCCCCTGAC AACACTGGAT 
9751 CAGCAATTCT CCTACCTAAG TCAGTCCCAC ACCATGTGCA GCAGAGTATC 
9801 CAGTGCCCCT GCCCTGGTCT GCTCACATTG GTTTGCTCTC CAGAATAATA 
9851 ATTCCTCAAT ATCCACAAGA GATTGATTCC AGAACTACTC CGAGGATACC 
9901 AAAAATCCTC AGATGCTCAA GTACCTGGTA TAAAATGGCA CAGTATTTGG 
9951 CATATGACCT AGGCATATTC TCTCCCATAT ACTTTATTTA TTTATTTATT 
10001 TCGGGACAGA ATCTCATTCT GTCGCCCAGG CTGTCACTCG CTTATTGCAA 
10051 CCTCTGCCTC CCAGGTTCAA GCAATTCTCC TGCCTCAGCC TCCTAAGTAG 
10101 CTGGGACTAC AGACGCATGT CACCACGCCT GGCTACTTTT TGTATTTTTA 
10151 GTAGAGACAG AGTTTCACCA TGTTGGCCAG GCTGGTCTCA AACACCTGAC 
10201 CTCAAGTGAT CCGCCCACCT TGGCCTCCCA AAAAGCTGGG ATTACAGGCG 
10251 TGAGCTACCA CGTCCAGCCC C CC AT AT ACT TTAAATCATC T CT AG ATT AC 
10301 TTATAATACC TAATACAATG TAAATGTTAT ATAGTTGTTT TAATGTATTG 
10351 CTTTTTTTAT TTGTATTGTT TTTTATTGCT GTATTATCCT TTTTTATGTT 
10401 TTATTTTTTC AAATATTTTC TACCCGTGGC ACCCACAGTT GGTTGGTGGA 
10451 ACCTGCGGTT GGTGGAGCCC ATGGATGTGA AGGGCTGATA GT AT GAG AAA 
10501 ACTCAGAGGT GCAGAGTTGG AGAGCACATC GGGGAGAATG TCAGCATGGG 
10551 TTAAAAAAGA CACACTGTGG TTGGAGATGA TCACATGAAT GGCCACTTCA 
10601 AAAATGAATG GGTCTCATCC TCAAAGCAGG CTCTCCTGGG CACTGCTTGG 
10651 GAAGGTGCTA ATTGGAGCTT CAGGCAACAA TAATAAGGGG AT AC AGGTG G 
10701 GGATCCTGCC ATGGGCGTAG CTTACTTTCT CTGGACTCTT CTGGGTCTTA 
10751 AGGCCAGTTT CCTCATCCAC TCAAAAGAAT GACAGCAAGG TGAGCAAAGC 
10801 AAGGCAGGTA AATGAGGAGG ACTCTTTCTG GCTGTCCAAC TTTTCATCAA 
10851 CTTCCCAAAG GTTTTTGGAT GGGACATGAG CACTCATTCC TTCTCCACCC 
10901 TTTAGCTAGG CCCTGTCAAC TCCAGGAGGA AGGTAGAAGA GGTCAGAGCT 
10951 GTGGTCTTTC ACTTATTCAA GATGTTTCCT TAGTGTTTTG TGTTTGGGTT 
11001 TTTTTTGTTT TTTTTTTTTT GACAGAGTCT TGCTCTGTTG CCCAGGCTGG 
11051 AGTGAAGTGA AGTGGCATAA TCTGAGCTCA CTGCAACCTC TGCTTTCGAG 
11101 TTCAAGCGAT TCTCATGCCT CAGCCTCCTG CATAGCTGGG ACTACAGGCA 
11151 TATGCTACCA TGCCTGGCTA ATTTTTGTAT TTTTAGTAGA GACGGGGTTT 
11201 TGCCATGTTG GCCAGGCTGA TCTCAAACTC CTGACTTCAG GTGATCCAGC 
11251 CACCTTGGCC TCCCAAAGTG CTGGGATTAC AGGTATGAAC CACTGCACCT 
11301 GGCCCCTTAT TGTTGGTTTT TAAAAGAGAA ACTAAGCTGT GCTTCCAGAA 
11351 CCCAGTTTGA GAAAGTTTGA AGACCTGGCA TAGAGCCAGT GACATATAAT 
11401 TGTTAGTTGA AGAAAGAGAG CTCCTTGATC TGCAAATAGA GCACGGCCCC 
11451 ATATTTAAAT TCTGCACATT CT AG AAGC AT TTTGCAAGAA TCAAATGCTT 
11501 TGAGGATTTT GCTAAATAAC CATGGAGGAA AGCACTAGAC AAATATTTTC 
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11551 AGATGGCATG AGAGTTATCA TTCATAGGAA TTATATTTCC ACTCCTACCA 
11601 CTTACTGGGG ACCCAAGTAA GAAATTACTT GGATAAGCAG AGGAGAATTT 
11651 AAAGTTGAAT GTGGTGGAAC TTATTATGGA AAAAATATGT TTTTCTGAAA 
11701 ACTGGATATG TGTATATATA TAAGTTCAGT TGTCATTTTG GAACCATCCT 
11751 TACTCTTCCT AGCTAAGGAT TAGCATACAT AGGTGCAACT TGACTAACTC 
11801 TGCCTGGACC CAATTCAGTT ACCTTTTGGT GGGTAGGGTT CATGAAGAAG 
11851 CAGTTATTTG TGGAGTGTAT AGAAACCACT CTATTGTAGG TTCTTTAGTT 
11901 GGTACTTTCA AAATAAGTGA CATCCAAATA GTAACTTAAT ATTCCAAATA 
11951 TGGCTGCAAA ACAAATTGTC G AT T ATGG AT GACTACTACT GCCATCTCTC 
12001 CATACCAGTC CATCTTCTGC CAGGCTGTTT GGTCTTGATT TGTCGACCTT 
12051 TTAGGTTTCT CCCCATGTAT TCCACATGAC CTTCACCAAC CCCACTTCTA 
12101 TCTCCAAACG TCTTTCTGAG TTGTGGGGAT GCAGATGTAT TCTGCCACCA 
12151 TCACAAGGGC TAACCGAGCC CTGGCTGCGG ATCTTCATTG TTGTTCACAT 
12201 TATTTCCATT CTTACACCCT ACTTCATGTT TGTACACTAT TTTCTTACAT 
12251 TTGCTGTCTC TTCTAAACAT TCTTTGCTGC ATCCACTTTT TCTCTATTTG 
12301 TGCTCTAGGT GCTGCAGAGG CTAATGCTGG GTTTCCTTTC ATTCCTCCTT 
12351 GCACTCAGCA CCTCCCTTCT CAATTCCTTT TGCCATGTCT CCACTTTAAA 
124 01 TCTTAACCTA CTCCAGATAG TCTTTTCCTT CACACTATTG GCATCTGTGC 
12451 TTGGGTTGCT TTCAGTCTAT TCTCTGATCT ATGATTTCTT TGCATGATCA 
12 501 A G AAGGTGC C ATGAAAGGAT CCCTTAAGAA AGCCTGTCAT TTAGCCAGAA 
12551 CGAACTAGCT TCATGATAGC ACCAGGAAGA CTGATATCTC CCAGGAAACA 

12 601 AACCACTCAT GGTGGTGCTC TTTTTGCCTT CACTATGAAG TGTTTGTCTG 
12C51 CCTGTATGTG AAAAC G AG AG GGTTTAATTG TAAGGATGCA GCACAGATTG 
12701 GGACTGGCAT CAGAAAGCCA TTGGGGACTG AGGTAGCTCT AGAGACCGCT 
12751 TTCTGTCTCC AGTGCTCTCC CTCCTGGGTG ACATGTTTTC TGTCTCCTGG 
12801 CATCTCTGCT TCTCTCTATG GGCTTCTTTA TTATTTGCAG CTTGCAATGG 
12851 TACCCCAAAG TCCTAGCTCA TGGCTCCTCT CTGCATATAT GCTTTCTGTT 
12901 CCTACCCACA AAGCTCTTTC TATTCTTCTA GTTTAAATTT TCAAGAGAAG 
12951 AAATCTGATT TTTTTTTAAC CTGGTCATGT CAAAGACCAC TGACCACATA 
13001 TGAGCTGGTT GCCCTGTGTC AAGTGCCCCC TTCTCCCACC CTCTTCCCCT 
13051 CCCCATCTGG TCTGTCATAA CTGAATGATG GAGTGGGAAA TTGAAATTGC 
13101 CATGGGAATT CCATGATAAG CTATCTAAAC AGTTTTATCT ATAAGTGGTA 
13151 GACAGAGTCA CTTAGAAGGG AGTCCCAGGT GAGACAGGCA CCTGTCAACT 
13201 CCAAACTGGC ACACATTCTA AGGTCTGCAA CACCCCAGAG AGAGCAC TGA 
13251 TTTTGTAGTG GCCTGTACTG GGGCGGTAGG CTGGAGAATG GGAGAAATAG 
13301 CCACTTCAGA ATCCCCCAGC CCAAATGCAT CAAGCTCACT ATAGACTCTG 
13351 CAGCCACGAT TCAGCTGGCT TCTGCTCAGA TCAACAGAAA ACATTCTTAG 
13401 TGAATGATGC TTGTGGCACA TATCTCAAGG CTACCAGGGT CATTTCTTCC 
134 51 CATTTACTTT TTCTCTGATC TATCCTCTCC AGGACACTAG CGTCAGAAGA 
13501 TAATCTTCCG TCGTTTTCAG GTACACTATT TGGGTACTGA GTCACTTTCA 
13551 AAGCCTCTTT CTGGGTTTGG ATTTCCAGAG CAGCCTGTGC TGTAAAGCAA 

13 601 GACAGAAAGC TTCCCTGCCA TTCATGCCTG CCAGGGATAG AATGACAGTA 
13651 CTCCTGAGGC TCTCCCTCCC CACCCCTCCC CTGCTGGACA GCTGATCTGC 
13701 TGGACTCAGC CAGAGCCAGC AGGCACCCCC TCTTTATCCT AGGAGCTGCA 
13751 AACTTGATGC CTTTCCAGGA AATCCCCAGA AGCTGGAGTA TCCTCATCTA 
13801 CATGTGGCAC AGTGTATGGT TGTGTCAGGT GCTCATGTCC CATTGCATAG 
13851 GACTGGGGTG GAAAATAGGG ACCGTCCTTT TGTGTCAGCT CCAGTCAATG 
13901 AGTAGTGGCC ATCCAGGGGG CCATCTTGGA AAGGACTTGT GAGGCTGTAT 
13951 CTGCGCTCAG TTGTAGATGT GAG AAG A A A A GGCCAAATAT CTGCCAATCC 
14001 TAGTCCTGGG ATTCAAGATA GAAAGAACTG CATGGAGTGA AGAAACTAGG 
14051 AGTCTCCATT TCACTGAGAT GCATAAGAAT GAAATTATTG TCACTATTTC 
14101 TTCAATACTG GGCCAATCCT AATAAGAAAA CCCTTTTTGA GTCTCTCTTT 
14151 TCTTTATCCT ACATATAACA CAGAAGCTTT TTCTATTCCC TGGATGAACC 

14 201 CACAGGGACA GAAATTCTTG TTGGACAGGT GAAGCAGATA ATTTCTTTAT 
14251 CAGACTAGAA TCTTCCAGAA GCACTGCTAA CCTAGTGAGT TTTGTACTCT 
14 301 AGACAGGTGG TTCTCAAGCC AGCTCCCCAC CGCAGGCCTT TTTCATGGTC 
14351 TGCCCCTCCC TGTGGAACCC ATGTTTTAGG TTATTAGCTG ATAATTGGAT 
14 401 TTCTATTTTT TCTCATAAAA TACAGCAAAA GATAGCTAGT GATATTATGA 
144 51 TGAGTTAATG TAATTATAGC CAAAGCAGAG AGAAACAACA TTTTAATTAA 
14501 CCTGTGTGGA CTGCTGGAAG AATATAAACT TTCTATTTTG GGGGTTGAGT 
14 551 AGAGACAGAA ATGAACACAG CCAAGGGCTG ACTGTCAGAG GACATTTAAC 
14 601 TGATGTAAAA TGCTTTGAAA TTATTGGGCA CTCATTGTTT AAAGTTGTTT 
14 651 TTGATGATGG TAACTCCGTA AGGGGATCAG AACATGCTGG AAAGAATGGG 
14701 CACAGCTTTG GTTACCTGGG CCTTACCACT GTTATTCAGG CCTCTGAGAA 
14751 AGCTTACTAT TGTTGTTATG TTTCTTACAT AATAAAACTT CTAATATTTG 
14 801 TATGAAAACA TAGAATTCCA CTTTTAAAGA TGTAAGGATT TTGTCATACC 
14851 ATTAGGGTTA CTATGATCAC TTGATTCTAG GTCTAAGAAA TATTAAGTAA 
14 901 TTTACCCGCG AACACAGAGT TTTAAGGGTA AGTATCAAAA CCTTGATCTT 
14 951 CTAATACCAC ATATTCTCAC TCATATGTGG GAGCTAAAAA TATTGAGCTC 
15001 AAAAAGGTAG AGAGTAGAAT TGTAGTTATT AGAGGATGCG AAGGAGGATA 
15051 GGGAGAGGTT GGTTAATGGA TACAATGTGA AGTTATGTAA GAGGAGTAAG 
15101 TTCTAGTGTT TTGTAGCACT GTAGGGTGAA TATGGTTAAC AGTAATTTAG 
15151 TGTATATTTA AAAAAAAATA GACAGGATTC TGAATATTCA CAAAGAAATG 
15201 ATAAATATTC AGCTGGGCGT GGTTGCTCAC GCCTATATTC CCAGCATTTT 
15251 GGGAGGCCGA GGTGGATGGA TCACCTAAGG TCAGGAGTTT GAGATCAGCC 
15301 TGGACAACAT GGTGAAACCC CGTCTCTACT ATAAATACAA AAAATTAGCT 
15351 GGGCATGGTG GCGCACACCT GTAGTCCTAG CTACTTAAGA GGCTGAGGCA 
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154 01 GGAGAATCGC TTGAACCTGG GAGGCAGAGG TTGCAGTGAG CCGAGATCAC 
154 51 GCCACTGCAC TCCAGCCTGG GTGACAGAGT GACACTCTGT CTCAAAAAAA 
15501 AAAAAAAAAA GAATGATAAA TATTTAAGGT GATAGATATG CTAATTACCC 
15551 TGATTTGATC ATTACACTTT GTATACATGT GTCAAAATAT CACTCTGTAT 
15601 CCATACATAT GTATAATTAT TATGTGTCAA CTAAAAATAA AAGGAAAAAA 
15651 ATCATTTCAG TGTATTTACA AAACATATGT AACCATTAAG AATAATGTTT 
15701 TAAATTATAT CTAAGGGTGT GATAAAATTA CAGTATAAGA TTGTGCTTGA 
157 51 AAAAGTGCAA TAAGAAGTAA ATATGTACAG ATGAGAAAAA GTGCAAAGAA 
15801 CTAAGTCCTA AG C AG AC TAT ACCTTTCCTA CTGCATGGTA CTTCTCTGGC 
15851 CTTTTGCTTT GAAAGATTTT GCACCCAGCA TGGCAAGTGG TTAGCAGAGG 
15901 CAGCCATTCT CACTTGTGCG TTGGCTTTGG GAGCCATATA TGTTGTTCAG 
15951 CTGGGTGTGG AGTGGAAAGG CTGCATGTTG TATTAATGCA TTGTTAAGAA 
16001 CCTCTAAGAG TGATTTCTTT TGGGAAGTGA GACTGACGGT CCGAATGGTG 
16051 GAAAGACAAC TTTTAATCTT TTACTTTACA CTTTGTGCAC TTTTAAATGT 
16101 TTAACATGAG CATGCATTTC TTTAATAATA AAAATACAAA AAAATTTTAG 
16151 CCCTAGATCT TCTGATTTTA AACTGCATAT TCTTTCTATT GTGTTACATA 
16201 TTTTAGCATG AGAATAAGGT TATGAAGCTG GAAGTAGCAG GCTCCCTTTT 
16251 CCTCATATGT AGGAAGTTAA GAATGCATTC TACGTTTCTT CTTTAAGGAG 
16301 TTGGCTTCTT TCCTTTTAAC ATAGGGGTAA CTGGGCCCAG GGAGTTT GGC 
16351 AAGGGCCAAA TAAAGTCCTT AATGCCCAGC TCAGAAATCT GGATTCACCA 
16401 TCCTTGACTG CTGGCTCCAA CCCACCCTCA CCTGAGCTGG TCTGCAGAGG 
16451 ATTCTTGTTT GTGTCACTTC ATCACCAGCA ACTACCGACA GATGATGCTT 
16501 TGGCCTGCTG CCTGGGTAAC AGGGCGAGGC TGGCTCAGGA CCATGTTTTC 
16551 AGATCAGGGG ACCTCCTTTG ATGC CAT GTC CATGGTGTCC GAGGGCAGCC 
16601 AGGATCAAGG GCTAGACGGG GCAGTGATGA GATGAGAGCA GGAGGGGCTC 
16651 AGCTGCAGCC CCAGGAGAGC CTATGCCAGC CCTGTTGACC AAGGAGGACA 
16701 GAAGCAACAG GAGAGCGGAG GCAGAGGGGT GAGTGTCTAT CGCTCAATGT 
16751 ATAATCGGCA GACATTTGGG GAGCTCATAC TGTGGGCTAA GCACAGGGAA 
16801 GAAAGGCACA GTCCCTGTCC TCAGGGAGGT CACAGTTGAT AGGGAAGACA 
16851 AGCATATGTG CTAGCTGCTA TAGAAGGGGG AACCACTGAG GGCTGTGGCC 
16901 ACACAGAGGC AACACCCCCT TCTTGTTTTT TTGTCAGGGA TTCAGTTTGG 
16951 CGTCATTAGA AGTGACTTGC ACAACCCCCT CCTCCAGTCA ATTCAGAAGG 
17001 ACTTGTTAAG CAGGAATGAT GAATTAGCTT CAGCTTGTGG GGC AC AC AC A 
17051 GATGGAAGTA TAAGGTGGCC TCAGGAGTAA GTAAATCCCC ATGCAAGCTG 
17101 TGTCCTTAGA CCAGAGCAGC ACCCGGTTCT TCCCCATTTC TAGTAAAGGT 
17151 GCCTCACACA CCACCAGGAC AC AATT T AT G CCTGCAGAAT GAATGAATGA 
17201 ATGAATGAGT GAATTCCTGG AACCTCTTCT GCTTATGTGC CACACCAGGT 
17251 TGCAGCAAGC CCAGGGACAC CTGGGACTGG AATTGGGCTC TCAGGTGTAA 
17301 GGACCAGGGA GCACCCACCA TTTTGCATTC TTCAGCCCTT CCTCCTCTCC 
17351 TGTCCCAGCT TCAGCAATAT CCACAGAGCC CTCTGAGCAA CTCTGAGCCT 
17401 CTCCACAGCC TGACGCCTGC CTGGGCACCA GCTCTTCAGA GGGTGTTTCT 
174 51 GTGCTGCTCA GCTACCTCTG AGCCTGGGCT GCCTTTGATG CTCAGGAGAC 
17501 ACCCTGTAAT TCAATTAAGC CTTCTCTCCA GGGAGCATGT AATTATGTCC 
17551 TATCTGGGCC TTGTAATGAC AGCCCCCTGC CACTCTACAG GGAGTTGCCC 
17601 TGCTCAGCTG CCCAGAACCT TTCCCTGGGA GGAAACTAAT CTGCTTAGCC 
17 651 CAGATTGGAC GCAGTTCTGC ACAGCACTTT TCCGAATGCC TCTGAAATGA 
17701 GTCCTCACTG ACAGAACGGG CCCACTCTGG GGGAACTGAG GGCTCTCTTG 
17751 GTCCTGCACT GCTCTTTGCC ATACAGATCT GTCTGCCCAG GATTTTTCTT 
17801 GGGTGTGTAG GAGGCTGAGA GAGCTCCCCT TTCTTCTCAT GGCTAAATCC 
17851 CTTGGTCTTT CCAGCCCTCC TGGGGGTTAG AAGGGAGAGG GAAAAAAAAA 
17901 AAGACTGAAC TTGTTGTTGT TGTTTTTGTT GTTGTTGTTG TTTGCCTGTT 
17951 TTCTATGTTG TCTTGTGGGG AGAGGGTATA AGATTGATTG ACAGAGTGGC 
18001 ACACTTCCCC TGCAAATTCA TCATTTGAAT TTCTCAGGTA AGATGTTCAC 
18051 ATTTCTCTGT TAAGATGCTC CAATTTCTCT GGTTAAGATT TCTCTGGTAA 
18101 GATGCTCATG AATTGGTGGA GGTGTTGGCG GGATGTGGGA AGTGTGCCTG 
18151 CTCTTTCTGA GTTTTGGGGG AAGTTGCCTT AATTCTCTGC ATGACTTTCT 
18201 TTGCTCCTTT GGGCTTCATT TCTGTGCAAT GTAGTCTGAC ATGAATACTG 
18251 CTCAGGGAGG TGTTGCTTCC CACTGCCCAC GCCACTGGAA ACCAGTAGCC 
18301 CAGGTTTACT CGAGTCCTCC TTTTGAGGAA CCCAAATTCT TTCATTTCTT 
18351 TTATGTGAGA TCTGCCCAAA ATGCCATTGG CAAGCTGTAC TGGGTTGAAT 
18401 AGTGTCCTTC CTCCTCCCAA ATGTATGTCT ACTCCAAACC ACAGGATACT 
18451 ACCTTATTTG GGAATAGGGC TTTTGCAGGT GTAACCATTA ATAGTTATGA 
18501 TGAGGTTATA CTAGATTAGA ATGGGCCCTA GATCCTATGA CTGGTATCCT 
18551 TACAAGAAGG CCATGTGATG ACAAAGACAA AGAATGGAGT GAGGCACCCA 
18601 AGGAACTCCA AGGATTGCTA GGAAACACCA GAAGCTTGGA GGAAGGCATG 
18651 GAACAGATTC TCCTCTCGGA CCTCTAGAAG GAATCAGTCC TGCTGATACC 
18701 TTGATTTTGG ACTTCTAGCC TCCAGACCTG TTGGGGAGAA TACATTTCTA 
18751 CTGTTTTAAG CTACCACGTT TGTGGCGATT TGTCACAGCA GCCATAGGAA 
18801 ACTAATACAT ACAACCTGCA CAATGCCTAC TCCAGCATTC CATAGCAAGT 
18851 CAAGGGCCTC ACAATTATGT CCAAAGGACT GATAGAAGAG CGACCTCTGT 
18901 GCTACTTGTC CCTCAGGACG CTGACCCACA GCTCTCAAGG CAGGAGTAGG 
18951 CCAGAGCTCA TTCAACAACT TTGTTATATA GGGGTTCCAA TTGTAAACCT 
19001 TTTGAATTCC TGTTTGCAAG TAGATGAGGG TTGAAAAATA AATGGCCACT 
19051 TTCTCTAAGC CACATACCCC AATCTGTTTT GTTACTTCAT TACAGCTGTT 
19101 ATAATGGCCT CCTCTTCTAT CTTCCAATCT CCATAGCCCT GGTTCCTTGA 
19151 TAGTTCTTTT TTTTTTTTTT TCTTTTTTTG AGGCGGAGTC TCGCACTGTC 
19201 GCCTGGGCTG GAGTGCAGTG GCACGATCTC GGCTCACTGC CACCTCTGCC 
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19251 TCCCAGGTTC AAGCAAGTCT CCTGCCTCAG CCACCTGAGT AGCTGGGATT 
19301 ACAGGCACCT GCCACCATGC CTGGCCAATT TTTTGTACTT TTAGCAGAGG 
19351 TGGGGTTTCA CCATGTTGGC CAGGCTGGTC TTGAACTCCT GACCTCGTGA 
19401 TCCACCCACC TCAGCCTCTC AAAGTGCGGG GATTACAGGC ATGAGCTACC 
19451 GCGCCTGGCC AGATAGTTCT TAAACAACTG CCCAGAAGTT CCAGCCTAGG 
19501 CAGGGGCAGC CATGAACTGC ATTGCTCATT TCTGCTTTTT GACCTTTTCG 
19551 ATGGCTGAAC TCTAGGCCAT GGAAAACAAG GACCCACTGT ATAGTTAAGA 
19601 GTCATTTTGT G A C TAGGG AG ACAAAAAAGG GCCTATTCTC CAAATCCCCT 
19651 TTCCCTCTGG AGTTCCTCGG TGCCTTAAAG CTTGTCCTGA GCTACAGGTG 
19701 TGTTACCTGC TTATCCCAAA ATGCAGGCAT GTTACCTGCT • TTCCTCTGCA 
197 51 AAGAGAGGCA GGCCTGGCTG GGGCACAGCT GAAGATGTCA AGGCCAACCT 
19801 AAGGGCAGCC AAGCTATGGC TGTCTGTGAC AAGAGGAGAG CAGCGGTGAT 
19851 GGGAGGGTAG GAGGCATTGA GTTCATGTCC GGGTTTGCCT CCTACCCTCC 
19901 TATCACTGCT TGATGATCCT ATCACTGTCT TGATGAGTTC AAGACAGAAG 
19951 TTTGCCTCAT CATTGCCACA ATAAAATCAC CAATAACAGA AGTGTGAAAG 
20001 CAGCGATGTG AGTGGAAGCC CATATATACA CAGGGGGTAA TAGAGCAGCA 
20051 TGATTAAATA TGTGGCCTTG TTATCAGACA GGCTGATTTG GAGTCCCAGC 
20101 TACTTGTTGG TGACCTGAAC TAGAGGAAGT TATCTAACCT TTCATTTTAC 
20151 TCATTTACAT AACATGGCTA ATAATAGCAC CTACCTTATA GGGTTATTGT 
20201 GAGGATTGAA TACAATTATG CAATATAAAA CGTTTAGCAT AGTGCCTAGT 
20251 CTAAATTCCT CACCAGGGGT ATGATGTACT AGTTTTTAGT TAAGTAATTA 
20301 GTATCCTGGA CATGTCACAG CCATTTGACC TATCTGGGCC AGCGTTTTGC 
20351 TCAGGTTCCC CCAGCAGTAA TTGTATTCCC TCCCCAATCC CGGGATTAGC 
20401 TTTTAGGAAG AAACAGTTGA TCTAAAGATA GAAAGTCAGA GTACTGTCTG 
204 51 GAGGAAGGTA GAGGGAAATG TCATTATCTG GGTTTTCTTT GATGATGTCA 
20501 GGGAACATGA CAGGCTGCTC CCAAAGACAG AGCAGCCCCA GGACAGGGAA 
20551 GAAGGTGACC TTGAGGTTGA CTCCTCTGCA TCCCGATGTG GACGTTATGG 
20601 ACTTGTTTTG GAGATGAAGG GAAAGAAAGA TGGAATGTAG AAAGTGAAGG 
20651 AG A AT AAA AG AAGTGGGAGG AAGAAGGGCT GGGAGGAGGA TGGGCAAAGT 
20701 CTTTCTGGTC TCAAGGATAA TTACATGTGA AATCACTTGC CAGTGGGACT 
20751 CTGGGGCTGG AGCAGCTACA ATAATTACAG TACAGGCTGC AGAGGGCTCT 
20801 TGGGCATGTC TTGGAGCAGC CTGTAGGCAG TACTGAGGCC TCTCTCACTA 
20851 GACCCATCTC CCAGATCACA TAGTACACAC ACCTTCCACC CCCGGGCCTG 
20901 TTAATGATCA AAAAGCTTAA ACAGAACAAT TACAGCTTCA GAGTGGAACC 
20951 ATATCTCTGG GCTCCTGTGA TGAAAACCAC AAGCCTGTCA GGCTGGGGCT 
21001 GCTTCACATG GAGGGCCCTG CTCTTAATGG CCAAGTGATC TGGAGCAAGA 
21051 CCCGTGACTC TCCCATAGTG CTGTGGATGG TGCTGCCTCT CCCCACGCAT 
21101 CCCCAGAAGA GGAAGTTCAG TAACTAAGGA ATTAACTATT CTCCAGCCTG 
21151 ATTCTGCTTT TCCCAATCAG GGCTTTATAC CTTTCTTTTT C ATCCC TATA 
21201 TTTGGAGATG AGTCACCCTT GCCTTCATTT TACCTAAGCA AGGCAGTTTC 
21251 CTGTAACCTA ATGAAGTGCC AAACAATACT GTGATTTATT TAGTACTTAC 
21301 TGTGTGCCAG GAATTCCAGC AGGTGTTGGA CATTTATGAT GTATGATCCT 
21351 TACACTAAGC CTGCAATGGT GCAACCCCAG CCCTGACCAC TCTGTGCTTC 
21401 CCTTTTCACA ACACAGCTTG TCACTAAATC CAAGTCAGGA ATTCCAGGTT 
214 51 AGGCTTGAGT TGTGCAGAGC CCTTAACTGA AAT TTGCC AT GGTTGAGGCA 
21501 TGATTGCAAT CACTGACAAC TCCTCCCGGC TCTACACACC TACTTGTCAT 
21551 ATTCACGCCC TGATCACGGC CCCACTCGCA TCTCTTCCCA CTTTAGAAGT 
21601 TCTTTCCTAT AGAACACGTT GCTGCTGCCC TGTTCTGGTC ACTGATCAGC 
21651 CCTGGCCTAA CCACTGGCTA AGCTTTGTGC TTGCACATAG CTGGTTGAAT 
21701 CGTATGTATT GCTGTTTGTG TACATCAAAA AT AT AAT AAT AATATCGGCA 
21751 ATTTTATGTG TTTCATTCAA CATGAGGGAC CCAGCATTCT TACCTTGTCG 
21801 CTTTGTAAAC CCTGCTGCTC TCAAATCTCC ACTAGCTGTT TCCTGAGCAG 
21851 AAGGAGATAA AAGGCTGGCT CACACCCCCA TGTTTTTACT GGTCACAGTT 
21901 ACTGCCACCA TCCAAGGCTG AAGAGACTTC CTTTGTGTTA GGGCTAAAAC 
21951 CTTAGTCATT GTATCTAAAT GTCTTCTGTA TTCCTTTCCT CAAAAGAAAA 
22001 AAGTACCCTC TTCTGCCAAC CCTCTCCCAT GCCAACTAAA CAAGCAAGCA 
22051 AGCAAACAAC AAAGAAAAGG TG AT ATT AC A GATGCTGCTC AGCCTATGAT 
22101 GGGGTTACAT CCTGATAAAC CCATCACAAG GGATGTAATT CCATTGCAAG 
22151 TTACAAATAC CATAAGTCAA AAATGTATTT ATTTCATATA ACCCACAGAA 
22201 CGTGATAGCT TAGCTTAGCC TACTTGATCA TGTTCAGAAG ACT TAT ATT C 
22251 GTCTACAAGT GGACAAAAAC ATATAAAACA AAGCCTATTT TAAAATAAGG 
22301 TGTTGAATAT C TC AT AT AAT TTATTGAATA TTGTACTGAA AGTGAAAAAT 
22351 AGAATGGTTT TCTGGATACT CAAAGTATAG TTTCTACTGA ATGCATATCA 
22401 CTTTTGCACC ATCATAAACT TCAAAAATTG TCGGTCGAAC CTTCCTGAGT 
22451 CAGGAATCCT GTCTGTACAG GGTATAAAGG AGGAAAGCAT CAGCTTTGGA 
22501 GGCAGGTGGA CCTGTGTTTG AACCCTGATT CTGCTAGAGC TTGACAATGC 
22551 ATATTCGTTT TCTATTGCAT AACTAATTAC TACAAACAAC ACATTTATTT 
22601 CTCAGTTTTC ATGAATCATG AGTCCAGGCA CAATTTAGCT GCAGTTAAGG 
22651 TGTTAGCTGG GGCTGCTGTC TTATCTGAAG CATGGGGGTG GGGGTGTGGA 
22701 TTCCAAGGTC AGGTGGTTGT TGGCAAAATT AATTTTCTTG CAGCTATAGA 
22751 ACTCATGGCT TGCTTCTTCA AGGACACGGG GAGAGAGAAT CTCTCACATC 
22801 TTTTAAAGGG TTCACCTGAT TAGGTCAGGT CCACTCAGGA CAGTTTCCCT 
22851 TAAAGTCAAG GCTTAATAGT CAACTGATTA GGGACCCTAA TTATATCTGC 
22901 AAAATACCTT CACCATTGCC ATGTAACATA ATCATGGCAA ATAATCACAG 
22951 GTCCCAAATG TTCACAGGTC CCACTCACAC TTGAGGGAGG GG ATT AT ATA 
23001 GGGCATGTTC TTGCGGAGAG AAGGAATCTT ACAGCCACAT TGGAATCTGT 
23051 CTTCCATGCT ATTTGACCTC AGGCAAATTG ACTAATCTCT TGAAGGTTCA 
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23101 ATTTCCTTAC CTGGAATAAA AGGACAATAA GATCAGCCAT ATAAGGCTAT 
23151 GACAAAGACT AAAT GAG AT A GAATAGGCTG GAAAAGTCTT GCAGATAGCA 
23201 GACACAAGTA TATAACAATT TCCCTCCTAC TGTTCCTTTT GTTTTTCACC 
23251 TATCCTGCAG TCTCTGTCAC TTCAAATACC ATAGAAAACC TTTCCAAGCA 
23301 GCCCAAATCA TGCCCCCAAA TAGTCACGTC TCATTATTCA TAGCAGTTAT 
23351 GTTCCATAAA GTTAGCACAA ACTCCGAATG AGTGAATCCT AAAGCGTTGC 
23401 TCCTGGAGGA AATACAGGCT GCTGGTCACA ATATTTTTAT CAACTGATCA 
23451 AT AT AT AC CT TGTCTTATGT GTGTTTCTGC TTCAAGACAC TTTATTTAAT 
23501 ATATACGTTG ATTCATTAAC TCTGAACTCT CTAGGCAACA GCATTATAAC 
23551 TCCTGCCTTC ACAAAGCTTA TCTAACACAC ACATTTCCTC CTCAGGCACA 
23601 TCCCAGCCTT CTTGCACTTA GGATTCAGCA GTATGCTTAA GGGCCATTTT 
23651 CAACAGCAAA CTCATCAGCG C AAAC AC AAA CATGTGAAAA ACGTAGCACT 
23701 AAAGAGACTG C AAAAAGG AC ACTGGCTTAC AGCATGGAAG CTGGAAGGAG 
23751 AAGGCAGAGA ATCACCTTGT TCCACTTCAG CTATGAATAT GCAGTCAGGC 
23801 CACCCAGTCA TTCAAATTTT ATAAATATAC TCTAATATAT ATATAAATAC 
23851 CAGGCAGGGT TATTTTTTTC CTCAAGTCAT TTTTCTAATT TTTTTTAAAT 
23901 GAATAGATAG AAGAGCTGAA GT AAGGGTC A GGAGCAAGAG CTCTGCTTCC 
23951 TTTTCCCTTG CTGGGCTTCG TTAGAGAGCC ATCATCTCCT CAATATGTCT 
24001 CCCAACTCTT CTAGGCATTG GATGAGTTTG CTGCAGATAC GAAACCCAAC 
24051 TTTGCCAGTC ACTTCATACT AACAGGTGAA ATGTAGTGGA GGAGCCTTTT 
24101 GAAGACAGGG ACTCAGCCCC CCATTAGCCT CATTGCAGAC CTAGATTCCT 
24151 GCCAAAATTA ATTTGGCTGG AACTTCCCAG CCATGGCATT GTCGACATTA 
24201 CACATCTTCC ACTGTAATGT CAATTACCAT TTTATTCAGC CGAATGCTGG 
24251 AGAGTTAATG TTCAAGTGGT TAGAGCTGGC T AC GGGTGGG CTGAACAAGA 
24301 TGTCTTTTCC TTCATTTCCC CTGCCTGTGG TGAAGGATTG TAACCAGCCC 
24 351 TGGCTGGCAG CACTTTGAAG CTCACCCAGA GTGCTCCTGG GGACATCTTC 
24 401 TACAGAGCCT ATCATTTGGA CATGCTGTCT TCTGGGCCTG TCTTCCTTCC 
24 4 51 TTCCTTCTTC CCTCCCTCCC TCCCTCTTTT CCTTCCTTCC TTCCTTCCTT 
24501 CTTTCCTTCC ATCTGCTTTA AAACCAGCTG CCTTGAGTGC TTGTCTTGGC 
24 551 GCCCCTCATT AGTGCCATTG CAATCATCCC TCCTGCCTAC CCTGCTAACC 
24601 ACAGCTTGTT AGTCCACAAC AGCAACAGCT GTGTGCTGGG GTGCAGCAGC 
24 651 TGGAGGGCCA AAGGT AGGGC TGGGGGACAG GGTGTTGGGA TGGTTTTCTG 
24701 GGGCAGATGA GTTTATACGT TTCTTTCATG TCCCCTTCCT CCCACATAGA 
24751 CTTTTATTTC CCCAAAGGAA AACAGAAAAC AATGATCTGT TTGACAGTGT 
24801 TGCTATCATT GGGCATCAAA CCTATCATCT AAGGGGAATC CCCCTGTATA 
24851 ATCAGTCAGC CAAATGGAGC AGGACCCTGT GTTTTGTAGC T GAT AC AAC A 
24 901 GGGCAGCATC TCTAGTGAGG GGGCC AGGGC TTCTATTTCC TTCATTAAAA 
24951 AATGAAACAG CAGACCTGAT TCCATATTTA GAGATTACAC TTAGTTGCCA 
25001 CTGTGGGTGT GCAGGCACCA ACCAAACCCA GTTGGCACCG TTGTCTTTTC 
25051 TCTGCAATGA TGTATTGAAT TTAATAATGG AGGTATATGA AATTCAGAGT 
25101 GATTGGAACT GAAGGTTTAG GGGCTTTGTG TAAAATTGAT ATGTAAGGGA 
25151 TTTGGAAGTA GGTGAGGGAT TCTTCCCCAA TACTTATTCA ATTTTGGAGT 
25201 CAAATAACCA AGCATTTACA AAT AGC C AAA AAAGAAATTG AAAGAGGGTT 
25251 TAATCCAATA AATTTTCATG CCTCATATGA ACCACATCTT ATAATAAGAA 
25301 TTATGCTTTT TCATTTCATA CTCAGTTAAC AAATATGATT TGTGAGCACC 
25351 TGGTAAGTTC AGGGCACTAG GCTGAAAGGG GTTACCAAAT GTCTTCATTT 
25401 AACAAAGTCC AGCTGAGCTC TTACAGGTAC CAGAACTGTG CCTGGGCTGT 
25451 CATATGAAGA TGAATGTAAG AGTGTGTCAG GCCTTCAAGA GCTTACAGTG 
25501 TGTCAGGAGA CATC A AAC AA GTGAGCCAAT AAAATGATAC TGCCATTTTA 
25551 GAAATAGCCT GAAATTCATG GAGTTCACAG TCTTGTTAGG AAAGTGAAAC 
25601 ATAAACCTAT AAGCATTAAA AAATAACTGT TGAAGACAGT AACGGAAGAA v 
25651 TGCAACTGGC AAC T G AAT G A TATAGGTTGT GATGACTGTT AAATATCATG 
25701 AAAAGAGACC ATGATGAGCT GAGGCACTCC AAGAGACTTC TTTTTGGAGA 
25751 TATGTTTGGA GCCAAATCTT GAAGATTTAA TTGCTTTTTT CTTTTTTTTT 
25801 TTTTTAGGTG GAGTCTCGCT CTGTTGCCCA GGCTGGAAGT GCAGTGGCAT 
25851 GATCTCTGCT CATTGCAACC TCTGCCTCCA GGTTCAAGCG ATTCTCCTGC 
25901 CTCGGCCTCC TGAGTAGCTG GG ATT AC AGG CGTGTGCCAC CATACCCAGC 
25951 TGATTTTTGT ATTTCTAGTA GAGATGGGGT TTTGCCCTGT TGGCCAAGCT 
26001 GGTCTCAAAC TCCTGACCTC AAGTGATCTA CTCGCCTTGG CCTTCCAAAG 
26051 TGCTGGGATT AC AGGC AT G A GCACTGTGCC TGGCCTTTTT TTTTTTTTTT 
26101 TTAAAAAAAA AAAAAAAAAA AACAGGAAGT TTTCGTTAGT TTTTTTGTTT 
26151 GTTTTACTTC CCATAAAAAC TCTTTGTGTC ACATGGAGGT GAATGGAAAG 
26201 AGAGGCTGTG GCAACAGACG GGAGACTTTT CTGATATCAG AACCCAGTCC 
26251 CATAGACCAG AATGTATGCT TTCAATCCAC GTTGTCTGGG TCCATCCTAT 
26301 TGAGTGCCCT GCCCCCACAG CGGGGTATGG AGAAGAGTCA GACACAGCCC 
26351 CAGTCCTCAC GTAGCTCACA ATCCAGTGGA GGAGACGGAC T C AG AAAC AG 
26401 ATAGAGATGA AGCCATGAGA TCAGTACTGT CCGAGGCCAT GGCCACGGTT 
26451 TTGTGGGAAC CC AC GAG AGG GAATGACTAA CTGTGGGGAA GAAGAGGGAG 
26501 AGGACCAAAA TGCAGGGGAA GTGCTCACAG AGGATAAGTA AGCAGTGAGG 
26551 TGCCATGAAA TGAGTATACA CCTGACAGCC GTGTAACAGC TCAGAGCCTG 
26601 GGTAGAGGGG AAT AG AG C TG CTGGTTCTCT GGGGGGAAGA GAGGGGTATG 
26651 GGATTCTGGA ACAGAAGCAC CAAAACCAGC AGGTTATTGG AGCTGTTAGT 
26701 GCTCAGATCA GCAATGGGTG CACAACCAAA CCATTCTCCT AGGGATGAGT 
26751 TCTTTCCTGT GGATGAGGGC TTCTCAGCCT GGCTTCTCCC GAGAATTACC 
26801 CGGGAAGCTT GAAAAGTACT GATGCCTGGA ACCTACCTCC AGAGAGTTGG 
26851 ATTTCATTGT GTTGACGTGG GGCTGGGATA TCAGTATATT GTTTAAGCAC 
26901 TCCAGGTGAT TCTGATACGT AGCTGTGATT GAGAACCCTT GCCCTAAGCT 
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2 6951 ATCCATCTGC ACTCCAGGGG TGCTCCCAGG CCCATCTGTT TGTAAATGGA 
27001 CAGGTGTCTT GAGGTAACAA ATGTGCCAAG GCTCTGGAGC CAAGCACGCC 
27051 TGGCTCCTTA GTGCCTACTT AGTGACCTCA GGCAAGTTAC TAAATGGCTT 
27101 AAACTTTACA AATCCTTAAT TTGTAAAATG TGGGCAATGA TAGTACCTCC 
27151 TCACAGGATT ATTACGAGGT TTACACGGAA TACTCTCAGC TCATAATAAG 
27201 CACTTGCACA GGCCTCATGG GCTAGGCCCT CAAAACTTAA CGCATCTACA 
27251 GGCAACAGCC ATATGAAAGG AATTTTATAC CACCAAGTCA AAAAATCTGT 
27301 GAGCACTGCT CAGAAGCAAA AGCCTGTCTC CAACAGCGCT CATTTAAGGG 
27351 GTGGGCGAGC TACAGAGAGA AGAATGAGCC CCCACAGGGT AAGCTGGGGA 
27401 AAGCTGGGGA CAGAATGAGA CTCAGGAAAT CACTTGAATA TTGATTATAT 
27 4 51 TTGTGCTCAA TAATAAAATA ACGAAATGAG TACAGCCCTA GACCTAAACA 
27501 TTGTGGGTGA GGCAAAGGCA ATGCGTTAAT TTTGCATCCA CTGAGGAAAA 
27551 ACTCTAAAAC GGTGACTTCT TTTTTAAGGG ACCAGAAGAA TC TAG ATT AT 
27 601 ATTTAGTCTA AGTCAATACA TACGACAGAA CCTTGCCCTC TAGACTTGAT 
27651 AAGAAAGAAG TAAAATAAGA GAAAGAATAA AAAACCC TTC CACCAAAATA 
27701 CTAACATTCA GATAATGACT TTTTAGTTAG GTCTCCTGGA GAGGAGGTTC 
27751 CCTCAGAAAT GAATAGATTT CTCTTCTAGT GCAATCATCA AAAGGTAATG 
27801 CATGGACTTA AGTGTGATCC CCAAGAGAAA ATCAATGACC TTTCTGTGTT 
27851 TGCCTTTGAG AAAATCAGCC AGTCTATGGT TAAATTAGAC ATATTTTTTC 
27901 TCCTTGGTCA AGATTAGTGG GACCAAGAAT GCAGTCTTAC ACTCCTTCTA 
27951 GCAAAGAATT ACCTGATGCC TTATTTC ACA CAAATTTGCA AAGTTGTATG 
28001 GACGTTGTAT CTTATTTTAA GGAGAACTGG TGATC AAA TG ATGACTATTT 
28051 CAATAGTGGT TCATTTACAC CACCACCCTC ACCCCACATC CTGCTTTCAC 
28101 CTGAATCTGA ACGATCATAG TCAGTCTGAG ATTCTGAAGG TTTGAAATTC 
28151 CTTTTCTGAG CTCTGCAAGA ACAGCATCTC CCAAGAGAGC TCAGGGCAGA 

28201 CTGTCTGGGA GAGATTGGAA ACCTGTCTTT TGCAGTAACA TGAATTGGTT y 

28251 GAATGGTCAC CCTCCATATC AGGCCTGCTT CTCCCATTGG GTTTCTGATC 

28301 AGCCCAACTT GGGTCTCACC CTTCTGATTT CTCTCTCCTG GCTCACATGG 

28351 GGCTGCACTG GCCATTAGGT GCCAGGCTTG GCTCCGTGGA ACCCATTGGC 

28401 CAGCTGGGCT CTGTGGAGCC CTAAGGCAGG GCTCTGGTCA CTGGTGAGAG 

28451 GGAGGCCATT GGAGTCACTG GGGTGGACCT ACAGACCCTA GGGTTAACAG 

28501 CTAGGTGGGT GTCCTCTTCA GAGAAACGGG TTACAAAGTG AAAGAAAGTT 

28551 ACACTGTGAG GTCAGCCAGG GAGGAAGACA GAGAGCTGAT ATAAGATAGG 

28601 TACTGATTCC CTGGGGATGT GAAAGGAGGG TAATATTCCT AAAATGATAG 

28651 CATTTAGCTT CCAGTATACA TTAATTGATT CCTGATATTC ATTAAAACTA 

28701 AACGCTATTT CCTTGATGTC TCATCCAAAG CCGCACCACT CTTCCCACTA 

28751 AGTCTGAGGG GAGCTTGTTT TGTTGACAAG TGTAAGAGGT TGAAGAGGGA 

28801 CCCATGAACT CTTTTGTCCT ACTGAAGAGA TCCACAGATG GAAACAAATG 

28851 CTCCTACCAC ATTTATGAAC TGCTGCTTTG CAGTCCCGCT TCTGCTATCA 

28901 TGCACAGGAA CTGACTAAGC TCCAAAGCCA GAGGATGTAA ATCTCCCTGT 

28951 AATAAATGTA AGTCATTTAT TAGCTACATA CACTTCAGCA AGTCACCTAA 

29001 CCTGCAAATT TCAAGCATGT GAATCTTGGA TCTTTCATGT GCTAGCTGTG 

29051 AGACTTTGAG AAATGTATTT AATGTCTCTT TGCTTCCTTT TCTACCCACA 

29101 CAATGGGTAT AATAATGTCT AC C AT AT AT C TTTGCAGCAA GGTCTAAATG 

29151 GGGTGATACA TGCTGAATAC ATTTCCAACA GAGTCTGTGC AATGATAAGC 

29201 TCTTTCCAAA TGTTAGTTAA AGCTAACCAA CTAACCCACC AACAAACCAA 

29251 CCTCTTAGCC AGGACTGATG GAAGGAGTCT GTGAGAGAAT GCATTTAAAA 

29301 CACTTGGCAC CATGCCTGAC AAGAGTAAGT ACT CG AT AAA TCAGTTATTG 

29351 TTATTATCGC ATCGGTATTA TGACCATTAT CCTCTTCTCT ATAGGCTTCA 

29401 GGTTTTCCTG TCTTTTTATC ACAGCAGTAT TCCAGCAGAA GCCTTTGATT 

294 51 TAACTAAGTC TCTACTGTGT GTGTGGCTAG ATGCTATAAA GCATCCAGAG 

29501 AAGTGAGAAT TTGGTCCTGC TTTTAAGTAG CTTATAGTCT AATTAGGGGG 

29551 AAGTAATCAG AT AG AAAGG A AACTAACAAT ATGCAAAAGG AAACTCATAG 

29601 TTTGTGGTAA ATGCCAGGTG CTGCTGATAG TGGCTTCAGA GAGATCTCAT 

29651 AGATGCTATA GGAGGTCAAA GGAGAAGCGT GCAGCTTGAG CTAAGTTTTC 

29701 AGGGAAAAGG GTGAAAGAAT TAGTCATTAA TGTACACCTA CATTACCTGC 

29751 CAGACTCCAT TCAAAAATAT TCTTACCAAA TCATCACAAT ACCTTGTTGG 

29801 TAGGTACTAT TACTATTTTA CAGAGGAGGA AAGTGAGGCA AAGACACATT 

29851 AAATAATTTT CCCAGAATCC CAAGGTGTGA GGTGGAGCAA GGACACAAAT 

29901 CCATGGCTCT AAGTCCCTCC TAGTATATCC TGCAAACACA TCTGGAATTA 

29951 ATGCAGAGAG GAAGGGGAGA GGCAGTGTTC TGCAGGAGTT CAGAGCCATG 

30001 ATAACCCTTC TTGTGTGGCT TTTGGTAAGT TATTTTACCT CTTACCCTCT 

30051 GTTTCCCCAT CTGTTCAATG AAGGTTGTAT AT AC AC AC AT TATATGGCCG 

30101 CTGTAAGTGT GCAGTGATAT GATGCATGGG GACTCAGTTC ATGAGGCAGT 

30151 GTGAATTCTG AAGGT AT C AC AATGGGACAG GTGTTTTTTT CTCCACTCAT 

30201 TTTCTCCGAA AGTCTTTTGT TTTGTTGCCC TCCCTCTTTG GGGCATATGC 

30251 TTTCAGCTCA TACCTTAATG ACATCAGAAT CTGCAATTTC CTGGCAACTT 

30301 TTGTGGTTAA AATTATTCTG CCCTTCCATT TTAAAGCACT AATAGCAAAG ■ » 

30351 GTATTAGGTG C AAAATG AT G ATAAAAATAA TTGCAATTTT TACCATTAAA 

30401 AGTCATGGCA AAACCACAAT TACTTTGGCA CCAGCTGAAT ATTTTGAAAC 

30451 TCCCTACTCT GATGTTAACC AAGTTCATGA TTCAAAGAAC TTGCAGAGGG 

30501 GTAGGGGAAT TTCAAGGGAA AGGGGGAGAT GCCTGGGGTT GTCACACACT 

30551 CTGTCTTTCA TCCTCTATTG ACATGTTGGT TATTTGGAGA TGGTATTCAG 

30601 TTCCACTATA GCCCCTCAGT CACTGTAGAC CCTCTCAAAG GGGCAATCAT 

30651 GTTTCCCTTA GGTCAGGTCC ATTCATCTAA CCCCTCTCCC GGGGGCATCA 

30701 CCTTGTTTGT TCCAGCAGCT GTCTGGCCAA ACTCACACCT CCTCCTCACC 

30751 CTCTAGCCCT TATGATCTGC TTTGGGGAGC CATGGGAACC CCTAGTTTCC 
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30801 TCTTTCATAC CCACTGAGAT TCACAAGTAA CTAAGGTCAA GGCGGGGCTT 

30851 CATTGCCTTT CTGCAGATAC CTTACGCTAC TGTTCCTCCT CGCCTGGCTG 

30901 GCTCCACACT CCAGCAGACC TTCTGCTGGG CGAGAAGCTG CAGGCCTGAA 

30951 TCTCTGTGTT CTCATATGGC CCCAACTCTT GGGATTACAC TAGCTCTTGT 

31001 AAGAACTCAA TGCTCTGCTC TGCTCATTTT GATGCCATCA AAGAGGGCTT 

31051 GCAAGTTACC AGCTGGGAGT GAACACCAGT GTCCTCTTTT TAGAGGTACC 

31101 CCTAATCTTT CTGAACAATT TTGCTGGCAC CCCTTCACTT GGCTTTGCCG 

31151 GGGTAAGAGG GGGCACTTCT CTCCTTTCCC TCATGAAAGG AGGGAGAGAA 

31201 GCCAAAAATC TCCCTACTAG TCAACAACTC AGGCACCCCT CCTTCTCTCC 

31251 TCTATTTTAT AGACTGGGAA GGGAGTGATG GTTGTTGGAG GTGGCAGAGC 

31301 CAGTTCAGCT GCCTTTTGTG AAGTCCTGAA GGAGGTGTCT ATCCTCAACT 

31351 GCTGGCTTCT GTCCTTAAGC CTGGGGAGAA TTAAGTCCTC TTTGCCTCAG 

31401 TTTGGCACTC CAATTGCCAA CATTGGGACA GCAGGAAAAG TTCCATCCAA 

31451 CATCCCATTA AATATGTAAT GTGTATTAGC ACAGCGCCTG GCACTGGGCA 

31501 GGTATTTTCT^ AAGTGATAGC CAATGCGAAG CCTACTTTAT TATTTTCCTC 

31551 TTTGCTTAAC CTACAAGGTG TCTAAGACCA TTTGTTTGTC CACACATAGT 

31601 AAGATAAACA GCACTGAGAC TGTGGTCCTT TCTGCCCTGT GTCCTTATCC 

31651 CACCTGGGAA TCTGGAAAGC C AAGCCT AG A CACACTCGTT CCACAAATGT 

31701 TTACTGAAGC TTGTTCTATT CAAAGCACTG TACAGCTACA AAGACCATCT 

31751 TTTCTGAACT CCAAACCAGG CCACATGGTT GGAATAACTT CAAGTATGGA 

31801 GACCAAGAGA AAACGTGGTT GTTGTCAGCA AAGCTCTGAG TCCACACCTT 

31851 ccaggaactt atagttgatg caatggtggg ag aagt ctg a acctggattc 
31901 aatctgcttg attccgatga atgctgcagt aggcagagcc atgagttcag 
31951 agcaggaaga aaccactggt tcaaagaagc atctgtcaca tcgaagctgc 
32001 tttatagtct ctt sgcaagc atgcataata atttattctt tctttctttc 
32051 ctttggtcaa caaarj» ttt z ttc.a 3tccct actatgtgcc aggtactctt 
32101 ctaggtactg aa g a t'v. agc agt saac aaa gaagatacaa tccctgccca 
32151 gcggagctta c a ttt t aott at ^aaagtc cctttctcag tggctgctct 
32201 ctttatttga gaaac jato : r.«.*t\:tt:*tcc tcccatccta gggctgctgg 
32251 ctccacagag g:a:a;ast: :atca:^atg ctctgccagc cacccaccca 
32301 ctcaagacca a'w'tt a " c. " t.*t a3t3tg agcagggaca ctcccgtctc 
32351 tgctacctcc tttct ."ctv. aaa :aa >atc tcagggaaca tctgccatcc 
32401 attttccctc cct^'^uact ua "av.aaag gtgtatggag gagattgagc 

32451 GGAGTGATGG ATTGA V» "A" T^T-^AA^TG AATCATTGCC TGACATGGGA 
32501 ATGAGGAGAC TT;;C77AAAo G A ." AA .V7 CAT GCTAAGTCAT CCATCGTTCT 
32551 CCCCTAAGGA GGTGAA77JA ASTT'.; CCATT TTTCCCAGGG AGCCAAATTA 
32601 ACAAGGTGCT GGG A *VA TTTC CAAA7TAGAA AAAAAAAAAA AAAAAGGCAC 
32651 CACCAGCTCT C AAA 7 'J A SAG Ar.'iCTGTTGA GTTGTTTTTT GGAGCAGATC 
32701 ATTGTATTTG GCATCTAACC 77CAAATAGA GGAGAAAGCA TGGAATTTCT 
32751 GCTGAAAACT CATCCTTCTC 7C.A "pCAGGTG GTACAAATAA GCATCGTTGT 
32801 GTTCTCAGAG GCAG/iAACCA CATTTGCACC TTGATACCAA CTACCTCAAT 
328 51 AACCACAGTG CTGAATTTTC ACAAATTGCG AATTAGGAAA TTGTTGCTCA 
32901 TTTTACAATT TGGTTTCCCT CAGGATTCCT TTT AAGT AGC CAGCTACCCC 
32951 AGTACTTTTG AAATATGACT TGCTTATAAA AATTTGATAG GCTTGGCACG 
33001 GTGGCTCACA CCTGTAATCC CAGCACTTTG GGAGGCCGAT GT GGGGTGG A 
33051 TCACGAGGTC AGGAGTTCAA GACCAACATG GTGAAACCCT GTCCCTACTA 
33101 AAAATACAAA AACTAGCC AG GCATGGTGGC ACATGCCTGT AATTCCAGCT 
33151 GCTCGGGAGG CCAGGCAGCT AGGCAGGAGA ATCACTTGAA C C C AGG AG AT 
33201 GGAGGTTGCA GTGAGCCAAG ATCATGCCAC TGCACTCCAT CCTGGGTGAC 
33251 AGAGCAAGAC TTCATCTCAA AAAAAAAAAA AAGATATATA AACAAGTTTT 
33301 TATAATATTC TCAATATGAA CTAGTAGAAA AAAAGCATGT GTTTTTAGGT 
33351 CTTAGAGGCC TGGTTCCCAG TTTTATCTCT GACTCTAATG AGGTATAGTA 
33401 TTACCTACAT TGATTAGCCC TTCTATACTT CATAGGAGAT GCTCCAAGAC 
334 51 TGCTAGCTTT CTTCATTCAA TAAAGAGAGA TATAACAGGA TGGGCCTTAA 
33501 AAGTAGCATG CATTTCTTCT TTCATTCACT CATTCAAAAT ATTTTCATGC 
33551 GTGAAAATGC CAAGGATGTT TGGTCAACCA ACTCTTCCCA GACCCTGGCT 
33601 GTGAGCCTGG CTTAGAACAA TTCCATTTTA ATGGTCCATG CCCTCAGGCA 
33651 CTTGTATTCT AGTAGAAGAG CAAGGTAAGA AAACAGCTTA AAAAGTTAAA 
33701 CAGTTTTAGG TTGAGATGGG TGTTGTGAGA AAAAT AAGC A GGATGCTTTG 
33751 AACCTATGCA GGTAGGAAGG TCTGGAAAGG CCTCTCTGAT ATGGTGATGG 
33801 TTAAAGCAAA ACCAAAAAGA CCAACAACAC ATGGAACACA TGAAGGGCTG 
33851 GAAGAACAGT GTTTTATGGG GAAGGACTAG TACACACAAA GGCTGCAAAG 
33901 GCGAGTGGGC TCATTATGTT CTAGAACATG CCAAAAAGCG GGTGCAGCTG 
33951 GAGAGGGAGT AAGATGGCAC AAAAGGTGAG TGAGGTGGAC AGGAGC CTT A 
34001 TCACGCAGGC TTACACAGGC TCTCAGAAGC CCTGCGTGTT GGTTTCTTGG 
34 051 GACTACCGTA ACAAAGCTCC ACATACTGGG TGGCGTAAAA CAACAAAAAT 
34101 GTATTGCCTC ACAGTTCTGG AGGCCAGAAT TCCAAAATCG GGTGCTGGCA 
34151 GGGCTGCGCT CCCTCCAAAA CCTGTAGAGG AGAATCCTTC CTTGCCTGTC 
34 201 CCTAGCTTCC AGTGGGTTGC TAGCAATCCT GGGCTGGGTG ACTCCAGCTC 
34251 TGCCTTGGTT GTCACAGGGC GTTGTCTTTG TGTGTCTCTG ACTTCACATA 
34 301 GCCCTCTTCT TCTTCTTTTT GTGTGTGTCT GTGTGTGTCC ACTCTGAGGC 
34351 ACAGAAGTTT TTATTTATTT ATTTATTCAT TTATTTATTT CATTGATAAA 
34 401 CATAATAGTT ATGCATAGTT TTGGGGTACA TGAGATATTG GATACATGTG 
34 4 51 TACAGTGTGT GATAATCAAA TCAGGGTGAT TGGAATATCC ATTCACCTCC 
34 501 AAACATTTTC TCATTTCTTT GATTGGGGAC ATTATAATTC TTCTAGCTAT 
34551 TTTGAAATAT ACAATAGATT ATTGTTTACT ATAATTTCCC TGCTGTACTA 
34 601 TCGAATACTA GAACTTATTC CTTCTGTTGA GGGTGTACTT TTGCACCCAT 



FIGURE 3, page 9 of 57 



TCCID: <WO 0233088A2_I_> 



WO 02/33086 



PCT/US01/32152 



34 651 TAACCAACTT TTCTTTATGT CCTCCTTCCC ACTTCCCTTA CCAGCCTCTG 
34 701 GTAACCACCA ATCTACTCTC TACCACCATG AAATCAACTT TTT TTTTTT A 
347 51 TAGCTCTCAT ATATGAGTGA GACTATGCAG TGTTTGTCTT CTGTGCCTGG 
34801 CTTATTTCAC TCAACATAAT GACCTCCAGT TCTGTCCATG CTGCTGCAAA 
34851 TGACAGGATC TTATTTATTT TTTTATGGCT AAATGGTATT CCATTTTGTA 
34901 TGTATATCAT ATCTTCTTTA TCCATTCATC CACTGATGCA TATTTAGGTT 

34 951 GATTCCATAT CTTGGCTATT GTGAATAGTG CTCCAATAAC CATGGAAGTG 
35001 AAAATATCTC TTCAACATAC TGATTTCCTT TCTTTTGGAT ATATACCCAG 
35051 TGGTAGGATT GCTAGATCAT ATGGCAGTTC T AACTTT AG A TTTTAAAGGA 
35101 ACCTCCATAC TTTTTTTCCA TGGTGGCTGT ATTACTTACA TTCCCACCAA 
3 5151 CAGCATATGG TCATCTCCTT TCTCCACATC CTTGCCAGAA TTTGTTATAT 
35201 TTTGTCTTTT TGATAATAGC CATTCTGACT GGGGTAAGAT GATATATCAC 
35251 TGTAGTTTTG ATTTGCATTT CCCTTATAAT TAGTGATGTT GAGCATTTTT 
35301 TTATATACCT GTTGGCCATT TATATGTCTT CTTTTGAGAA ATGTCTATTC 
35351 AGGTCTTCTG CCCATTTTTA AGTGGATTAT TTGTTTTTTT GCTACTGAGT 
354 01 TCTTCGAGTT TCTTATATAT TC TG AT AC AC AGCCATCTTC TTATGAGGAC 
354 51 TCCAGTTATA TACGATTAGA GAGGTCCACC CTTTTTCAGA ATGAAATTAT 
35501 AGCTTAACTA ATTACATCTG TAGTAACTCT ATTTCCAAGT AAGGTCACAT 
35551 TCTGAGGTAC AAGGGT TT AG GACTTCAACA TATGAATTCC AGTGGGACAC 

35 601 AGCTCAACAC ATGACACCAT GGTAGGGAAC TTTATTCTAC TTGCAAGTTC 
35651 TGAGTGTCTT ACGCAGGTAG ATGGACTGGT GTGATGTATG CTTTAAAGAC 
35701 CGCTGTGTGA AGATGGCCTT AGGGTGATGA GGATGGAAGT TGGAGACTAA 
35751 T A AAGG AC T A AGAAAATGCT AAGAAAATCC AGGTGAGAGG TGATGATGGC 
35801 AGAACTAAGG TGATAGCAGT AGAGAGAAGA GAAGTGGATG GAG ATT AG AC 
35851 ATCTTTTGCA GAACGAATGA CAAAATACCC CTATGGATTG GACATGGGAT 
35901 GAGGAAAAGG AAGGACTTGA GGGTGGTGTC TAGGCTTTTT ACTTTAATCG 
35951 TGAAGGGAAG CTGGTGCCAT TTACCTTGTT CGGACAAACC TGGAGAGGAT 
36001 CAGGTTAGGG ACTGCGAGTG GTATGGACGG CAAAGGAATG GGAAGAATGC 
36051 AGGGATTAAA AATTGGAAAT CCCCCTCCCC AGTCAACAAT ATCTTACTTT 
36101 TATCTGAAAA ATACTAAGTA AAAAAGC AT C CTTTTGTTGG AAAGCTCAAT 
36151 CCTTGTTAAA ATGAAGACAT CTCTGGGAGA GGAAACATAG TGAGCACCTT 
36201 TCCCAAAAGC AGCCACTGAT TTGGAGATGA GACAGAGTAG CATACAGGAC 
36251 ATCAGAGAGA ACATGCTCAG GACAGAAAGA GCAATGTAGG ACAAGGCAGT 
36301 GTCTTGGCAT CACAGTCTTT CCTCCGACTG GCTGTGAGCA AGTGCTCAAT 
36351 TTAATTCCAT CTCAGTGCTG GGTCAGGACA AGTGCCCAAA AGCAAATTGA 
36401 CAAAAGTACC AGCATGATGG AGTTAGAAGG TAGCAAGTTC CCTCCACAGA 
36451 GCCCAGCTGG AAAGGAAGAT AGAGGGGAAG TTGACCCCTG GGGATGGGGA 
36501 ATAGGGTGAG AGGAGAACAT GAAACTGAGA AAAGGGCTTT GAGTGAAATC 
36551 TAGGCTAAAA GCTAAGGTTT CTTTAGAAAC CCACCATTGA CCCAACATGA 
36601 CCAGGGCTTT CTCTTGACTT GATTATTTTT GATACCCCAT CTTCTTCTGT 
36651 ATTCCTGGAA CTAGCTCTCC CAAGCCCCAG AATTGTGCTT CTATCAGAGC 
36701 TGGGTTTTCA TCAGAGTCTC CCCTTTATCC TGTATCTCTG TTGCCCTATT 
36751 TTGTTTGAAT TCCTGCCAGG TCAGCTGAAT TTGGGCATTT GGGGTGAAAA 
36801 ACCATCAAGT GTGGCATCCT GGCTTTGGCA CCTGGCACAG TGTGACCCCA 
36851 CTGGTCTCTC CCTCACATTT GCTGTGGTCC GTGCACGGAA TTTGTCAAAA 
36901 GACCTCCTCA GTATCAGCTT TCCTGCAGCC TCAATGCACC TTGTTCTGAA 
36951 TAGGATATTA CCCCCCAAGA GTATATTAGG GCATTTTCCT ATGCCAGAAG 
37001 GGGTCCTTAG GCCTCTTGCA GTTTTTTCTG GGTG AC AGTG AAGGAGGAGG 
37051 TGGCTGCAGA GCTTACTGCC TGTGGACTGA CCACCCCAGG GCCTGGTGTC 
37101 AGGACCATTT GTCCAGCCTG TTGAGTGAAG GTCATTCTGC CTAAACTGTA 
37151 AGCACAAGAG AGAGTTCAGC ATCATTTGCA TCCTATTTTA TTGTCTTTCT 
37201 TCTCTTTTCT TTCAAGGCCT CATTTTTTTT GGCTTGAACA AATGGTAAAG 
37251 GCCATTTTAT TACAGGTACC AAGCCAAACT TTCCTTGGTT TTGTGGCCAT 
37301 CCTGCTGGGG AAGGAAGTAC TCCTTTACTT TAAATAACTT TAAAAACATC 
37351 TGTTTGGTCT CAGGGGCTGC AGCTGGAAAG ATTTTCTAAC TAATACTTGT 
37 4 01 TTTATGGGGG TGTTTTTGGG GGGGTTTATT GAGTGTCAAA CCTGGCAGTA 
37 4 51 AATTAGAATC AGAAGACAAC AGTTAGTGAT AAGCAGAGAA GCCAAGGATG 
37501 TTACCATAGG CAGGCAGCAG AGAGAGGGGA ATTGGTGGCT GGCCCCCCAA 
37551 AAAC AG AT TT GAAGATCTCC TTCTGTCATG TAGTGAATCC CCAAGTGCCT 
37601 AGGGTGGGCT GTGATTACTT GAGCTCCTGT CTCCACTGTC TCAGCTCACT. 
37651 TGCCTTGGGG TGGACACACA ACACACATTT GCTCATAGCA TCAGGTATTC 
37701 AGGAGCAAAG AGCTGAATTT ATCTGGTTAA TTTAGATACC CCTACCCCCT 
37751 CTTTTAACAC CAGATTGCCA GGATCATGAC CTCAAAAGGC TACCCTGAAA 
37801 TGCAATTGAC AAATGGGATG AAAGATTTCC CGTTTCATCC ACATTTGCCT 
37851 CCTGAGCTAC TTACAGCAGC AGGTCACCGC AGCCAGAGCC CACCTGCTTG 
37901 CCCACCATGC CCGCACACAG ACAATGCTGC TTCTGTGGCT GGAGGTCGGA 
37951 ACACCTCAGC ACTATCTCAG TTTGGCTGCA GATCCTCTGT GTGCTTGGTA 
38001 AACAGGTTTC CTCATCTGTA AAATGAATTG GCTCTTCCAC AACTTTTTTA 
38051 AAAGCACTAA CATATTAGGA CTCTCACTAA ATACTCAAAT GCTAAACTCA 
38101 AATACTAAAA GAGTGCAAAG GGATGGGCTC CCAAATATTA CAGTGAAGGC 
38151 TGCAGCATTT TCTGACCTTG CTGCTTTTTC TGGTGAGTGG CTTTTATTTC 
38201 TTAGTTTGGT TTCTTCTCTC CCATTCTAAT CAAGCAAGAA GTGACCACCA 
38251 AAAGGGGCAC TCACCAAACC AGAACAAGCT AGTTCTTTCA TCTTTAATTC 
38301 ATTGC AACC A AACAGATGCC ACAGAAAGAG CCAAGGGCTC CAGGCTTTAG 
38351 CTCCAGCCTT GCCATTAACT ACATATGTAA GTCAGCCATG CTGGTCTGCA 
384 01 GGTTCTTGCT TTGCATGATC AAGGGACAAC TTGGAAGGTC TCCAATCACT 
384 51 CTATTCCCCC AGATGGAAAT GTATTCACTT ATTTCCTGGA GATGTCTGTC 
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38501 CTCCTCCCAG TTAAAGACAG ACCTTGACCC ACCTCCACTT CCTTCTCTGT 
38551 GGCCCTGTCT TATCTGTCCT CTTGTTCTTG CCTCTTCAAT TGTTCTCTCA 
38601 CCGTGTTTGT CACTTCTGAG CTATCACTGT GATCCCCCTG ATTGTTTTTC 
38651 TAATGTCCCT GAACTTCAAC CTGATTTTCA CGCATATACA ATGTCTTCCT 
38701 AAACACTTAT AGACTCTGAC ACATTCTGTA ACTGACACAT TTCCCTTTAT 
38751 CAAATGCAAT CTAAGAAGCT CACAGTTTCT CTCAGTTTCA ACAAGAGAAA 
38801 TCAGGAGCAC TTGAATTATA CAACTTGACA TTATTAGGGC TGATGTCTGA 
38851 TTTTGTCCTG TCTGCCCCTG TCATTTCTGT ACTACCTTTT ACAAAACCTC 
38901 TCCTATGACC TGTGTCCTCC TCCAGCTCCA TTTGAGAACA CCTGCTGTAT 
38951 ACCCTGTGGG CTAGCTTTTA TTATGTTCGC CTCAATGATG AAGAAACAGG 
39001 CTTGGAAGTT AAATTATCTA CCCCAGGCCC ACAGCCTGGA AC CTAGGATT 
39051 CCAACCAAAC CTTGTCTGAT TCTAAAGCAT AGCAGAGGCT CCATACTCTG 
39101 CCTCCCTCTT CTACATCATT TCAGTTTCTT CACTTTCCCA CCTCCAATTC 
39151 TCACCCAAAC TGAATGTCTC ACAGTCTCTG TGCCCCCACT TTGCTCCATC 
39201 CCTTGGCCTT CTGCAGTCCA AGCTCCATTC TGAGATCATC CAAGGCTTCT 
39251 CTTCTGTGTT GATCCTTGGC CTTCTTGGAG TCTCTTTCTC CCATGTTCTC 
39301 CACAACAGAG CATTCTCCTG ACTGTTTTCA TTCTGCATCT CACTCTTTCA 
39351 TCAGTATCTT TTTCTCTACC ATGCC CC AT A AATTTGGGTG CTCCTGAGGG 
39401 TCCTGTCCTT GTCCCCTGCT TTCTTGTTGT ACAACCTCCT TGATCTACTT 
39451 CATCTACTCA AGTTTGGTCC ACAATTTCTA TATTGTGAAG ATTCAAATCT 
39501 GCATCTCTAG CCATATATCC ATTTGCCTGC TAGGCATTTC TACCTGAATA 
39551 TTTTATAGGC ATGCCAGTGG CTCTTACTCT ATGGCTCTTA CTCTAAGTCT 
39601 AGACTACAGC AGAAAGCAAT GCTCTTTTTA TTAAGGCATA GTGCCTCTTT 
39651 CAGAATAATT TACAGCATAC AACCAGGCCT GCTGTGCAGC ATTACAATTT 
39701 GTCATTAAAA CTCCATTCCT CTTGCCAGAG TAAATGAGCC ATTTACAGCC 
39751 AGGGCGCCAA GATGGACTGT TGTTATTTTT TCTGCCTTTG TATTATGAGT 
39801 ATTCATGGCT CTCCTCAGAC AAGCTCCTGG GGATTCCCAG TGGAGTTGCC 
39851 TTAACATGCA GGTCAATTAG CCAGGCTCAA GGGTAGTTTC CTGGATATTG 
39901 GTATCCCCCT TGCAGAGGAC TGCAGGAAAG CTGAACAGTG TTCCCCCAAT 
39951 GTGGGTGGTG ATCCTGAGAA ATATCATTTG TATCTGCATG TGCTGTCTCA 
40001 C AC AC ACT AG CTCACATGTG CACACACACG TGCATGCACA GGACAAAACC 
4 0051 AAACACAGGG CAACCCAGCA TCTGCCCCCC AGCCATCAGC ATTGTTACAC 
40101 CTTTATAGGG GGCGGGAACA GGTTGGTCAG CAGGTGAACG TCAGGTGAGT 
4 0151 TGAGAAAAGT TATTAAATCT TAAATCCTTA AGGAAAGTTA TTAAATCTCT 
40201 TCTAAAATGC AT GC AT AGGC GGGCTCAGTA ACTAACATGC AAATGTTTAG 
40251 GGTCTGAAGC TCCTACCGAT AATCTTTCAG ATCTCAGAAT TCCAGCCCCT 
40301 TGTGCTGTTC TGGGTTGTCT GACACAGACG AAGCAGAGAA CAGTAGAATA 
4 0351 AACAGCTCAG TAAACAATTC ATTGAGGGAA AGAGAGTGAG AAGATTCACT 
40401 GGACAGCTAG AGGAGGAAAT ACTGCTGGTG ACTATGGAAG AAATTTGCCC 
40451 TAAGGCCTGC AGGCAATAGC TTGGTCTTAT TTATCCTGGT GTCCCACCCT 
40501 CTCCTCCAAC ACATACTGCC CTGGCAGGTA CGTAGAAGAT GCGTGAAAAT 
40551 ATCTTTTGAA TTGAGCTATG CAAAAAATAC TGGATTCTGC CCTCCAAGAG 
40601 TTTACTGTTT AGTTTCACAG AAAGCACATG CCCTCCTTTC TCTGCCTCTT 
40651 GAAGACTGAC CTATCTTTCA AGGCCACTGG CCCAATTCTG TTTTCTAAGT 
40701 AAGACCACTG AGTCAGTGGT GACCTCTCCT TCTCCCTAAC AAAGTCTGAT 
40751 TTACTTGAAT ATACAACTAT CTCCCTCTTG GCCTGTGAAT TTCTTGTGTT 
40801 AGGGAACATA TCTGATTTAT CCTTATCTCT TCCACAGTAC CTGGTGTAAA 
40851 ATGCCCAATA AATGCATTGA AATATTCATG AAGCTTACTA AATGCTCTGC 
40901 CTTATGAGCC ATGAAATATA AAGTGCCTTA AACTTTGTTT TTCTCTTATG 
40951 TAAAATAAGG ATAATAATAA TGACACCCCT ATAGGATTGC TGCAAGGATT 
41001 AAGTGTGATA ATATATATAA AACTCTTAGC ACAAACACCT GGCTCACAGG 
41051 AATAGTAGCT ACTACCATAA TGGTAACTTC GAGGGCAAGT TTTCTCAGAG 
41101 TTATTTAGCC CTCCTTCACC CTGTGTCCAG GAGTGCAGAT CAGAATGGTC 
41151 AGATTCCAGG ACACCAAGTT TTCTGTGGGA GCTTCCCTAG GAATATAACT 
41201 AAGGAATTTA AATCAGGTTC AGCTCATGCT GTTACACTCT CTTCCTCCAC 
41251 TCAGGCATTG GGTGTGGCTT TTCCAAGCTT GAGAAGGGTG TGATCTGAGA 
41301 TGGGCTTGGG TATAGAGGGG AATTATATTT AGGTCTACCC TGTATAGGAA 
41351 AAAGTGCCTT CCCAAAGTCT CCCTGGCCTA AAGTATAAGA GATATGTGTT 
41401 GGGATTTAGA CCCAGAGCCC AAGCCAATAA TGGGACCCCC TTCTCACATG 
41451 TGGCTACCTC CTGCTATCAC CACAACAGCT ATCATACCCA TAACTACAAC 
41501 AGAGGCCAAT TAACGTGGTG ATAATTGACA AATGTCAAGA CATCCTACAT 
41551 TGAGGCACAC TGTGCGTTTT GCGTGAGCTT TTAAATTGGT AGGGAAGGAA 
41601 AACTTTTATA CCTACACCTA TCATGGAAGG CAGAAGGTAA GAGCTAAAAT 
41651 AAAGGTATGC CAAGAACAAA GGCAGGAAAG AAGGGTTTTA ACAACTTGAG 
41701 GCCTGATCCA TTGATTAGTG AAGAGGAAAC ATGTTCAAAA ACCACTCTAT 
41751 AACCACCTTC TCCAAGTTTT TTATAATTTT GCTTCTTCGG ATATCTTCTC 
41801 ATCATAGTCT TAAATGCCAT CAAATTAACT GAAAAATGCT AAAAATGCAA 
41851 CCACTCTAAG AGAATGGGTT AGATGGGAGA TGGCTTTGTT AAAGAAGTCG 
41901 GTCTTAAAGC AAAAGTAGGG CTTTGTCATG GTAGTATGGA AGGAAGGACA 
41951 TTTTTGGTCA AGAGAAGAAA GTGCAGGGCC TGTTGAGGAA GGAATGAGTA 
42001 GTAAAATATG GCTAGAACAG GGTGCAGAGG GGAAGAACTT CAGAGAATGA 
42051 CCAAATAAAC AGGCTGAAAG GTGTAGACAT TATAGGCAAT AAAGCAACCA 
42101 CAGAGGTTTC TAAGCCATAG GGTGACATGA TAGATCTGTA TTCTAGAAAA 
42151 GTTAGTTTTG CAGCAGTTGT GTCCATTGAA AGGGACAGGA TAAGGGAGAT 
42201 AGATAAGAAG ACATGCTATG ATGATAACTA GATTTGGATA CCAAGTGGTA 
42251 TGGTGGAAAG GAATGAGAGA ACAGGGTCAC AGATGAATGA CTGCCCAATT 
42301 TCAATCCATC ATAACAGGAT GTATAGGATT GCCCTTAAGT AAGATGGGGA 
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4 2351 ATCCAAAAAC GAGGAACAAG TTTGTAAGGT TTTGGGGGCC AATGATGAAT 
4 24 01 TCCATTTGGG ACATGTTGCT TTGGATATAC CAATGGGACA TTCATGTGAA 
4 2451 AATGATCTCG GCAATCCTAT CCTGGAATTC AGGATAGGAT CAGAATGAGG 
4 2501 GACACAGTTT ATAAGGTAAA CAGAATGGAG GTGATATAGA AGATAAGGGC 
42551 ATAGATGAGC TTACCAAAGG GGAGAGTTTA GAATGAAAAG AAAAGACCAA 
42601 AGGCTAAGCC TGTGCTATTC TTTCTCCTCA CAATACGCTT CAGACCTGGG 
4 2651 CACAAACCAT CAGTGAGTGT CATGATAACA CTACTGTGGG CAAATCCCCC 
42701 CTCTATAAGG GCCTGATTTC CTCCTCTATA AAATAGAGGG TTGAACAGGG 
42751 TGGTCCATAT CCTGTTAATT GTGTTTGGAG AGCACACAAC AAACCAGCTA 
4 2801 CTATCCAAAG GGGACATCCC GAGGCAGGAC TAAGCAAAGG AAATCCAGCA 
42851 CAGGGAAAAC ACTTTCTGGT GCTGGTCCCA GTTAGGCAGC GTTCAGTTTA 
42901 ACCCATCACC ATCACCATCA GTAGCTTTCA GCTGCTACTG ACCACACTTA 
4 2951 TAGGAAGAAA AACAATTAGA ATGGAGAGCT AACTCTTTGG AAATGGTCAA 
4 3001 AGAACACGGG TCTACAAAAC CGTCAATAAA GCGCTAAGAT GCCTGGGCGG 
4 3051 GGTCAAAAAG TCTACCTGGG CGGGGTCAAA AAGTCTACCT GCTCAGCATA 
43101 TGGGGCCCAG ACATCTGACC TTTACCAACT CCACAATAAC CACTTCATCT 
4 3151 ATGGATCCAG TCTTGGTATC ACCTAGTCGC TGTTTTCAAG TAACAGAATA 
43201 TTTGGTTCTC AATGGTAGGT GACTGGAATA CAGCTTACTT TCTCCCACCC 
43251 CTACCCCC AA TCCTTTCTGC CCCCT TAT AG TTTAATTTGC TTGTAAATTA 
43301 CTTGGGAA7A CATTTGGGAG CCATTATAGG GAAATAGAAG GCAGACATGA 
43351 TGAACAGAAT GCAGGGTCTT TTTTATTACT TCACATTGTG CTCAACAATT 
43401 AGGAGGAATT CTAGAAGCCC CTCCCAGTGG CCAGGAATTG GTCATAGCAT 
43451 GAATAAACTC AATATAGGTT GAGTATTCCT TACCCAAAAT GCTTGAT ACC 
43501 AGAAGTGTTT TTGCA7T7TG GATTTTTTTT TTGAATATTT GC ATT AT AT A 
43551 CTTACCACTT CAGCATCCC7 AATCCAAAAC TGAAATCTAA ACTGCTCCAA 
43601 TGAACATTTC rTTTGAGTCT CATATTGGCA CTCAAAAGGT TCCAATTTTG 
43651 GAGCATTT7C AATTTTSGCT TTT ViOATTA GGGATACTCA ACCAGTGGTA 
43701 GGTTTGGGAT C*A T A TC" Af «C A TGCTAAGGTC AAAGAGACCT AGCTGGGAAG 
43751 GGTGGGAGGA A TATCV.AATT TT ZA TT-TTCT GGGCACCCCT TGAACAGTCT 
4 3801 TACT ATT AGG GTCCCAAA7T T-'iTTTTAAGT GTGTGTGTGT GTGTGTGTGT 
43851 GTGTGTGAGA GACACAGAC* r»A* .ATJ^^AGA GAATTTTCTT TCTTCCTTTA 
43901 TATTCTAAGT 7 r .(."T f ZA- *C*A~. AAA A TTTTGG GTTTCTTTGT ATTCTCCCTG 
43951 CAGCTCCTCA TCTA-'iTTTTA A AAATAAA GGAATTCATT AGGTC CTTG A 
4 4001 TTTCAGAAGC CTCCCACTTC TCTATuTAGG AGGAATCTTA GGGTGGCAAG 
44051 ATAAGTTGAG G~*A CTTT T C T T C AAGOA C AT TTCACAAGTA AGAGAAAATG 
44101 TTGACTGTGT ATATCTAAJA A7.*V;TGGGG CTCAATGATG CCCCCCTAAG 
4 4151 TTACTCTTTA CTA7TATTCA TTGATTGATT GATTGATTGA AGAAGCAATG 
44201 TTTTGATTGA TTGAAC.AAGT AATGTTTCCA ATGGCTACAG CAGACTGGAG 
4 4251 CAAAAGAACA AAATCAAAJ-A AAA? A ~ ATT A GGCTTTCCAT TTCTTCTAAT 
4 4301 TCTGGGGCAT CTGATGAAGC TTTGGATCCC CCAAGGTAAG AGCTGGACTC 
4 4351 TGCTGGTGAA AACTCTTTAG GAAAAACAAA AGAATATTGT CAGAATCTGA 
4 4 401 TGCACCTTAG AAATGATGCA GCAGAACTGC TTTATTTTCT AAAAGGTGAA 
4 4451 ATGGAGACCC AGAGAAGCAA AGTGATTTGT TCATGATCAT ACAGCTATTC 
44501 AGTAAAGCCA GGACTTCTGT GATCCACTGT CCTTTCCTTA AACCAGTGGT 
4 4551 TCTCAACCTT GGGAGCTTTA AAAAACTGCT AGTGTTGGAT CCATCTCAGA 
4 4 601 CTAATTAAAT CAGAACCCAT GGGGATGAGG CCCAGACATG AGTGGGTTTT 
4 4 651 TTGTTCTTTT TTAAAAAAAA GCTCCCTAGG AGATTTCTCA AAGAACTGAA 
4 4701 AATAGAACTA CCATATGATC CAGCAATCCC ACTTTTGGGT ATCTACCCAA 
4 4751 AGGAAGATAA ATTATTATAT AAAAAAGATA CCTGCACTCA AATATTTATT 
4 4801 GCAACACTAT CCACAGTAGC AAAAATATGG AATCAACCTA ACTGTCCATC 
44851 CATGGATGAC TGGATAAAGA AAATGTGTAT ATATACACAC ACAATGGAAT 
4 4 901 ACTATTCATT CGTAAAAAAG AACAAAGTCT GTCTTTTGCA GCAATATGGA 
4 4951 AGGAACTGGA AGCCATTCTC TTAAGTGAAG CAACTCAGAA ACAGAAAGGC 
45001 AAATTCCACA TGTTCTCACT TACAATTGGG AGCTAAATAA TGCATATGCA 
45051 TGGGCACAGA GTGTGGAATA ATAGACATTG GAGACTCGGA AGGGTGGGGG 
4 5101 GAATGGGAGA GGGTCAATGA TGAAAAATTA CTTAATGAGT ACAACGTACA 
45151 TTATTTGGGT GATGAATACA C T AAA AGCC C ACACTTTACC ACTATGCAAT 
45201 ATGGCCATGT AACAAAATTG CCCTTACACC CCTTAAATTT ATACAAATAA 
45251 AAATAAATAA ATAAAAGCTC CTTAGGGCTG AGAACTACTG CTCCTGTCCT 
45301 ATGGGTCCCC AGCTTTATTT TAACTCAAAA TGAGTTTAGA AAAATTTATG 
4 5351 AACCCATTTA AAAATATTTA TTGAGTATCT CCTGTGTGCA AGGCACTGTG 
45401 TTATGTTAAG TGGCTGAAGG GAAATTAGAC TGGGGAAAAA GACAAGGTCA 
45451 TGGCCTAGGT TTCAAACTAA TATAAAAGAC ATAACAAATA AGAAAGGATG 
45501 CCACCTTCTT CCAACCCTCA TCCCTCTTCC TTTTGACAGT TGCAGATGTT 
45551 GCTAATTCAT TTTGGCACCC TTTTTCTCTG ACCCAAATAT AGTCTTATAA 
4 5601 ACCTTTTCAA CCCACGGCTC TAGGCAAGTA TCACCTTTTG CTCTTTTGGC 
45651 ACCAGATCTC TTGAACACTA TTTACTGGTT TTGGAAAGAT TATACATGTA 
4 5701 TGTCTGGAGT TGAATGACTG AACAGAGCAA TAATAAGAGT TAAAGCAAGA 
45751 AAGACAGGCC TAG AGG AG AT GGCAGAGGGT CTTGCCTGTC AGGCATTGAT 
45801 TTTGAACTTC ATTGCATAGG CAATCAAGAA CTATTGAAGT TTTTGCACAA 
45851 AAGACTATAG ATGAGATTAA CCTGGTTACC GT AAAGG AC A AAGTGATTGC 
45901 AGGTAGAATG AGGCCAGCTT CATAAATGAA TCATCAGGAT AT G AG AAGC A 
45951 AGGGCTTGAA CATGAGAGGC CATAGTGGGA ATGGAGGGAA AGGGACAATG 
4 6001 TG AG AAGC AG TGAAGGAGAA GGGCTGATTG AGTAAAGCAG TGGAGAAGAC 
4 6051 AGTGAAAGAT GTCAGATGAC TACCATGTTT GGCGACTGAG TGAGGGAAGA 
4 6101 GGTGGTGATG ATATTACTGA AGAGAGAGGC AAGGGGTGGT CACTGGATTT 
4 6151 AGAGCAGACA TTATCAACTT GTGGTGTCCA GACATTTCAC CCTGGGAGAA 
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4 6201 ACCTGTTCTG AAGTGGCTTC AGCATCTCTG AGGTCAGATT CCTAGTTCTA 
4 6251 CTATTTTTCT ACTGACTGAA ATGGAAATCG AGTAGGCAAG GCTTTTGATT 
4 6301 TGTCTCAGTG GTCTCTTCTG TAAAATGGGG GTGTTTATAT CCATAGTCTT 
4 6351 ATCACAGGGC TATTTGGGGG ATTAAGTAAG ACAAGTGTGG CAGAGCTTTG 
4 6401 TAAACTGTAA TACACTGTGT ACAATTGGAT AATTATGGAT TCTTCTGACT 
4 6451 CATCCACATG GATGTCTGCT GACCCTGGGG GACCGGAGCC TGGGAGGGAG 
4 6501 GCCAGACCTG GAAATGGAAA CTTGAAAATG TTCTCTGTAG AAAAGATAAT 
4 6551 TAACATTTGA GGATGGTTAA GTCCTCTTAA ATAGATGTCA GAAAAAATGG 
4 6601 AGGTCATGTA GACAGAATGT TGGATAACAC TACTTTGTAA AATATTTTAT 
4 6651 CTTATTTCCA TTATAAAAGA AAAAAAGCTG GGCTGGGCAC GGTGGCTCAC 
4 6701 GCCTGTAATC CCAGCACTTT GGGAGACTGA GGCGGGTGGA TTACCTGAGG 
4 6751 TCGGGAGTTC AAGACCAGCC TGGCCAACGT GGTGAAACCC TGTCTCTACT 
4 6801 GAAAATAGAA AAATTAGCCG GGTGTGGTGA CAGGTGCCTG TAATCCTAGC 
4 6851 TACTCGGGAG GCTGAGGCCG GAGAATTGCT TGAACCCAGG AGGTGGAGGT 
4 6901 TGCAGTGAGC CAAGATTGCA CCATTGCACT CCAGCCTGGG CGACAAGAGT 
4 6951 GAAACTCCAT CTCAGAAAAA AAAAAAAAAT AGACAGGAAA ATAAAAAAAG 
4 7001 CCACCTCACA TAGTCTACTA CCACCAAACA CATCATTAAC ATTATATTTC 
4 7051 TTTATTCCAT GCTCTTTGTT TTTAATATAA ACAATTACTT TTAAGGGAAA 
4 7101 ATGAGAAAAG GAGAGAGTGA TAAGACTTTA TTTTAAAAGG TGGAATAATT 
4 7151 C7AACCATGG AGAGTATTTA TAAATTTTTT TTTTTTGAGA CAGAGTCTCG 
4 7201 CTCTGTCACC CAGGGTGGAG TGCAATGGCG TGATCTCAGC TCACTGCAAC 
4 7251 CTCCACCTCC CGGGTTCAAG CAATTCTCCT GCCTCAGCCT CCTGAGTAGC 
4 7 301 TGGGAT7ACA GGCAACCGCC ACCATGCCCT GCCAATTTTT TTTTTTTTTT 
4 7 351 TTTTTTTGGA GATGGAGTCT TGCTGTGTCG CCCCAGGCTG GAGTGCAGTG 
4 7401 GCATGATCTT GGCTCACTGC AAGCTCCGCC TCCTGGGTTC ACGCCATTCT 
4 74 51 CCTGCCTCAG CCTCCCAAGT AGCTGGGACT ACAGGCGCCC GCCACCGCAC 
4 7501 CCAGCTAATT TTTGTATTTT TAGTAGAGAC AGGGTTTCAT TATGTTGGCC 
4 7551 AGGCTGGTCT TCAACTCCTG GCCTCAAGCA ATCCTCCTGC CTCAGCCTCC 
47 601 CAAAGTGCTG GAATTACAGG TGTGAGCCAC CGTGCCAGGC CCATAAAATA 
4 7 651 TTTTTTATAG ACAAGTGAGA GCAGAAATCA CAGGTTCTTA TGAGCAGGAA 
4 7701 AATTTTGAAG GTCATCTACT CTGAACGTTT TTTTGTTTGT TTGTTTGTTG 
47751 TTGTTGTTTG TTTGTTTTTG CTTAGTTTAC ATTTATTAAA TACCCGTTAT 
4 7801 GGTCCAGGCC CTTGGCTAAG CGCCATCCAT GCAATATATC ACAAGATATG 
4 7851 CCCAGCAATC CTAGGAGGTA GGGTTTATTA CTACCCATCG TACAGAGGAG 
47901 GAAACTGAGT CATAGAGTTT TAGTGTCCTG ATCCTGGTCA CAGAGCCAGG 
4 7951 AAGTGGCAGA GCAGGCCAGG CCAAGTCTGT CTGACATCAG AGCTCATCAG 
4 8001 AGCCCTCGCC ATTGTCCTTG AACCAGTAAA GATGGAGTTC TTCTACAGGG 
48051 GTGGTTGGGG GACAAGGACC CCATGGGTGT GTCTGAGTCA GAAACATCTG 
48101 CGAGTGGGCT GAGAAATGAG TCTTCTGTGA AAAAGAGCAA AAGAAAAAAT 
48151 GGGTCAGGAG CCAATAATCA TTGTCCATCT TTGTGTGAAT GTATGGTGTG 
4 8201 GGAGTGGGAG CAATAAACGA TTCTAAGGTC AC AC AG AAAA GATGCCACCT 
48251 TCTCCAATCA CATACCGCCC CTCGTCCCCC AGTTTTCTCT GAAATAGCTC 
4 8301 TTCTTTTGGC TCTATCCTGG CTTCTTCACA CAGGGGTGTC CAGTCATCTC 
4 8351 ATCCTGGTGG GACAGGGATA GAGCTGTGGC AGTGGAGATG AGGAAGCTCG 
4 8401 CCTCCTAAGT GAGTCTGAAT TCTTAAATAT GGAGCCACTC CATAATCATT 
4 8451 TGGAGTGAAT ATTGGGCCAT GGCCCTTTTT CTTGCCAGCT GAGCTATGAA 
48501 AAAAGGATGT CCTAAGACCA GAGGCTGTGG GACCATTCCC AGCCCCTGCA 
4 8551 GGAATCAAAG GAGCTGACAG AATTGTTTGT TTGTTTTTTT CACAAATTGA 
48601 AAAAAAAAAT GTAAAATTTT TGAAAAGAAA GCCTCATTGA AAAGAAATCC 
48651 CTCTCCCCAG CTGGGCTCCC AGGCAGCCTC CTGCAGAACA TCCTTAGCAT 
48701 TGCAGAGTTG TTCCCATGGC AACCGAGTAA GGGGCTTTTT GTTTTCCTTA 
4 8751 GAAGATTGAA TCCTTTCAAC CAGAAGGTAA CCACTGGTTC TTCCCCACAA 
48801 TCCACACTCC AAACCCCCTA CCCTTATTTG ACTACATGAC TAGTTTTGCA 
4 8851 TTTATGGATT TTTTTATGCC TAATTGAAAA AGGCTAAATA TACAGAAACT 
48901 GAGGCTGAAG TGGTTTAAGG AGGCAACTGG CCCAGTGGTT TCTCAGCAAC 
48951 CACATGTCAA AGCTGTGGAC GTTAGACTTG ACGAGAGCAA GACATATCAG 
49001 AATCTGTAGC AGGAGCATCT AGTCTCCCAG TTCAATAGTG TCCACAAAAG 
4 9051 AAATCCAGAG GTTTTTGAAG CAAGGAATTT GGGTGGCACT GCTGTGAGAA 
4 9101 ACAATCACCT GGCTCCTCCA TGGGGCATAG AGTGGAGATG CTTCTTCAAA 
4 9151 TACCCCTTCC TTTCCAAGGC CATGACTCAG AATGACTGGC GTAGGGAGCC 
4 9201 TGGACCTGAT CTCTTCAAGG AAGGGGAATC AGATGAGCTG TTTAATCTCT 
4 9251 CTTGTAAAAT GAGGGGTTAT GAGACCATAG GCTCATTTTG GGGGGGGTCT 
4 9301 AAAATGCAGT ATTTTTTGAA CTGATATGGG GAAAAAAAGA CATTTCTGAA 
4 9351 TTGTTGTCAT GTTGCAGATT CTGGGCCGTT CCAGCATAAG CACCTTTCTT 
49401 AGAGTACTTG GCTTTGTGAA GTAGTCCTTA TCCCCTCCTT CCACTATTTT 
4 9451 ACATCAAGTT AAAATAGAGG AAGATGCCTA GAAATGGCCG TATAGACAGA 
4 9501 GAAAACTGCA CTAAAACTCC CTCCGTCATG CCTGACTCCT CTCTAGACTA 
4 9551 TGACCATCGA GGGGCCAGAA ATCATATCTT AAAGATCACT GTGCCTCCAG 
4 9601 TACCCAGCAC GGTGTTTAAT AAATGTTTGT TGAATGAACG AACTAGTAAA 
\ 4 9651 ATTTTCAAAT CATTAGAGCT GAAGTATCCT TTAAGATTCT TTAGTCCCTC 
4 9701 ATTTTACAGA TAAGGAAGCT AAGGCTCAAG ACATTGTGTG GCTTGGCCAA 
4 9751 AGGCACACAG CAAGCTAAAG GCAGAGGGAG GACAGGACCC GGCTGTCTCA 
4 9801 ACCCCCTGGC TGCTACACTT CCTGCAGCAT TTCTAATTCT TTTACCATTC 
4 9851 TTGCGAGGGA TTTTACAGGC ATGTACTGCT AGAGCCGAAA TAATTAGAAG 
4 9901 CCTCTTACTA CTCATCAGAA AAGCTATGTG AGCCCCTAGG GAGGACACAG 
4 9951 CTAGCCTAGA CTCTGCCTCT TTGCCCTCTG CTGCTTATTA GCAGAATGTA 
50001 AGTGGTTGTG TATGATGATT AGTGTAAGTA GGATGGGCAA ATGCACACCT 
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50051 TTCCCACCTT CAAACTCAGA AGTTGTAACC AAGAGTCACA CTGACTAAAC 
50101 ACTCCAATTT CCCTTTCTGT TTTTCTTAAC ATATGTCCTA TTTTACCAAT 
50151 AATAGCCATG GTATATTAGT CATGGTATTT CACGCTAGCT GCAGAAATAA 
50201 CTTCCAAATC TCATTGGCTT ACTCAGTGAA AGTTTATTTC TTACTCATAT 
50251 AAAGTTGAAT GTCCTGGTCA GGCAGTTATC TAAGCCACAA CTTGGGGATG 
50301 GGGATGCAGG CAGCTTCCAT CGTATTGGCT CCACCATTCA GGGATGGCAG 
50351 AGTTGCTCTG GCATAATCCA ACCAATAGAG GGGGGAGGTT TGGCACTTGT 
50401 CAGTTAACCA C C T AGCCT AG CATTGACACA CACCACTTCT ACATACACTC 
504 51 CCCTAGTCAT CATTCAGTCA TGTGGCCCAA CCTAGATGCA AAGGCATCTG 
50501 GGAAATGTAG CCCCTATCTG GTCAGCAACA ACTTTGCACT TGGAAGGGGA 
50551 GCCTGAATCG TTATTGGTCT CCAACACATG TAACTAGCAA TTATACAGAA 
50601 CGTTATTTGT CAGGCAATGT GCCAAGAATT ATTTCATTTA ATCTTCACAA 
50651 CAATCCTATG AGGTTATTGT CCTCTTTAAC GTATAGATGA AAAAGTTGAT 
50701 GGTAGAGATA TAACTTAACT AATGCAAAGT TGCATAAGTG GTTGGTAGCA 
50751 AATCCAAAAT TCAGGCTGTT CTCTCCAGAG CTCAGGCTCA TGATTGCTGC 
50801 ATTCTACTGC TTTGAGCTTC TGATCTGAGA AAATGCATCA GCCACTAAGT 
50851 AGCCTGTGTA GTCTCCAGCA ATTACTTTCC TCCCTCTGGA TCTTGGTTTC 
50901 ATTCTCTGCA AAGTGAGGAT GTTTAACTGG AT AAAATC TG ATGTCACCTG 
50951 CCAGCTGGGA CATCATATGA TTCTCAGGGT AAGCATATCA GGTGGGTGGG 
51001 GTCCCCAGTG ATGCTTGACC ATAGCAAAGC CCTTTCAAAG GTTTCTTAGC 
51051 ACACCACATA AATGGAAGCC TCACAGTGTC CATGTAGGAG AAAGCAGGGC 
51101 AAAGTATTTC CATTTACCCA ACAAAGAAAT CAACATATAG TAAAAAGAGA 
51151 GTGTTTTCCC ACCAAGGCCT CAGATTGACT AGCGGTAGCC TTGGAAATAG 
51201 GACTTTATTT TGTATAGTAC TTTTGCCACC AGGGTGGGGG GGAAAAGAGT 
51251 GCTTCTTTGC CCCAAATGCT GGTTTCATAA AACCTAAAGA TGTCACATGG 
51301 AAACACACCA TTCCCCCAAT CCCCCTCAAA AAACTACTTG CACTTAAATG 
51351 AAAGAGTAAA GCTGTAGGAC TTTACTGAGC AGTGTCCTGT GGGGTCCTTG 
51401 CACTGCCATG CTCTTGAGGG GCTCGAGGTG TATGAATTCC CCAGCATTAC 
514 51 TTCTCCTTAG AGGTTTCAGA TGAGCAGTAT GAGCTCCAAA CTCATGCTAG 
51501 ACCCAAGTAT TTCATGAAAG AACAATCCTT GAATGACTTT ATACAGCAAA 
51551 GCTATATTTC ACTGTGTCCT AGAAAACCAA TTGTGTGTGT TTGTGTGTGT 
51601 GTGTACAACT GCTTGTGTTC TTTCTACCTA TGTCCCCCTG ATGCCTCCAC 
51651 ACAGAACATC CCAAACTCCA TTTCAGGTTC CTCTTGAGAT TCCCAAACTT 
51701 GGAAACAGGA GATGCTTCAA AGGCCTCTTG GAATGTCTTT TGAGGCTTTA 
517 51 TATTGTGATA TGTTGGACAG ATGGTTAAGA AACAGAAGAA GAGCATCACC 
51801 AAAAGGATTT CTCATTTTAT GTGGAGATCT ATTAATATTT GCCACTAGCA 
51851 AAGGCATTCT TTCTTGGGAA TGAATTATGC CCCTAGAATC AGATTGACCC 
51901 CACAGAAACA AGGGAGAATA AATAGAGACT TGAGCTTAGA CCTTACAACA 
51951 TGGCCAGAGC TGAAAAGGCT GAGCTCTAGG CAGAGAAGAT GCAAGAGCAG 
52001 CTTCAGAAGA CCTGAGAGCT TATTTGGGTA GGTTCCTCTG GTGTAAAGGG 
52051 TTCTTGTTCA CGTTTTCTTC CAGAATAAGA AAAGAACGCA AGGTGTCAGA 
52101 GGGTGGATGG AAACAGGGTA TAAAGCAGGA GCATTTGGAA TCTGCCCTTT 
52151 GTAGCCTGGC CCAGAGAGCG TCAGGCAGCT TGTTGGGTAA TAAGTAACAC 
52201 TGGCATTTTT CCCATGGTTC TGTCATCTTA AAGAGCAGGA TACATAAAGG 
52251 GATTCAGATG TCTTGTTGGT TTGGAGAAGC TTCTTTTTAA TACCTTGTTT 
52301 TAAAATTTAC CTGGAATTTA TTTTAATCAG GTGTGGTAAG ATGCACAGAC 
52351 ATGGAGATGA CAGTCATGAA GGAAGAAGTA TTTATACTCA CAGATCCCTG 
52401 TAAATAGGAA GCATGGCCTC CATGCAGGCC AATGGGGAAG CACCAGGGTC 
52451 AGCCGCAAGG CAGAAGGAGC AAGAGGAAAA CATGGACAAG AGGCTCTACT 
52501 GTGGATTCAG TGGCAAAGAA TGGGAGGGGC AGAGTAAGCA GGTTTAGGAT 
52551 TATCGGGTTT GAATGACTTG ATTGAGCTGT AGGGTGTAGA GACTGCCTCT 
52601 ACTGTCTGGC ACCAGGGGTA ATTAGGGCAG CTGGATAGTG GTCTGGAGTG 
52651 TGAGAGCTCC CTAAAGGAGG TGGTTGGAGG TGTAGGTTTT GGATTGGTTG 
52701 ATCTGTATAT GAAAGGTGCA CGTGCAGGTT GAGTCCTCTA CTATCACTAG 
52751 AAATTGGCTG GTCCCAGGAG AAGTAGTCTC TCTAGAGACA GCAATGCCCC 
52801 AGATGTCAAA GCATCAGAAA ATACAGAAAA AAAATTAAAA GCATGATTAA 
52851 TTCATACTCA CAGGTCTAGT TTTTGTGTAG TTAAGAGCAA CCTAAAGAAG 
52901 TTGATAACTC GTGTTGCAGG TCAGGTTTCC CAGAAATCAT ATTCTCAGAT 
52951 GAAGATTTGC ATGAAGGAGG TTTAATGCTC AAACTAAGCC CTAAGGCTCC 
53001 ATACCTGTGG AGGAAGTGAA AGAAGCCCAA CTGGGCACAG AAGGTGGAAC 
53051 ACAATGCCAG TCACACAAAG ACCTCAGTGG ATCCTGGGCC ATGAGGAGCT 
53101 CTAAGCACAG ATGACCCTTC AGAAATGTCT CCAAGTGGGG AAAGGAATCA 
53151 TGCTAGTCAC TGGATGTGGG CTTCCCACTC CACCCCATGA GGGCATGACC 
53201 TTAAGTGAGA GAGCTCTTTG GACACAGGGC ATCCTAAGAG GGGCACTCAG 
53251 CAGCCACATT GGGCACCAAG ACTCTCAGCA GCTAGAAGAA GAAGGTATAG 
53301 TCCCAAAGGG GAATCTGGGC TGCACACCTT AGTATCCATT AGAACTGGAA 
53351 GTAGGCTGAA TCCCAGGCAG GGATCCCCTG GAGAACACAG GTAATTTTTT 
53401 AAAAAATCAA GCTATGTGTC TGAGGCTATG TGGTAAGACA TCTCAGTTTT 
534 51 CTGCTAGGAA AAGCCACCAA ACCAGATTGG CTTATTCATG TTGAAAAGTC 
53501 TGAGAATCAC ACTCAGATGT TGTTGATAAT TCTGCTTGGA T AAAATTT AT 
53551 CTATTGGTAT GCTTGTGATA TAGCAGTACC ATTGCTAAAA ATTCCATGCG 
53601 GAGAATCCAA TCTGCATCAT TTTCTTTCTC AATGATTTGT TTTTAAAGGC 
53651 AGAGGTTCGG CTGTGCCCCT TTAAACCTTC TGTGCAAGTG CCAGCTTCCT 
53701 TTCAAATGGA GAAGCAGCAG CCCTGTCAGA AAGGGTGGCT GGAGCTCCCC 
53751 TTTTGTGAGA GGAGGAAAAC TTACTGGGAA TTACCTGTTC GAGAGCCACA 
53801 CATGAAGGCA TACCACTGCT TCCTCTGACC TTCCAGCCGG TATATTAATG 
53851 ACATACTGTT GTACCTGAGA ACCAATGATG AAGTGGGTGA TGTGCCTGGC 
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53901 ACCTTAAAGG CCTGGGCCTG CTTTGACAGG GGAGATGATA CACAACATGG 
53951 CTGTTAGCCA GCTCTCACTG CATCTGGAAG CACCATGTTC CTTAGAGCCA 
54001 AAGTTCTCAA ACTGTGCTTC CTGCTGGGCT CCACAGATCC TTCCCGTTCC 
54051 ACCCTGCACA CAAACGTGCA CACACATACA CACACACACA CACACACACA 
54101 CACACACAGT GTTCTCAATG CTCGCCATTT AGTTAGTATG CACCAAATAT 
54151 GTGTAGTATC TGGTTCCACC CCTGGCCTCT CAGACAATTA TTAGTATTTT 
54201 TGGGAGCGGG GAGGAGAGTC AGGAAGACCC AAGCGCCATA TTTATTATTT 
54251 CCCCAGCCAC CCCGGCCCAG GCTACATCCA AGTTCAAAGT CTATGACCCC 
54301 CTCTCTGAGC TTTCAGCACT ACCTCCCTTT GTGGGGGAGG GGGGTGCCAA 
54351 TTCTCTTTCT TCTCATCATC TCCTGTTGCA AAATAAAAGC CTAGGCATTC 
54401 CTTTGAGAAA CTTGGGCCTG GCATTGGAAG . GCGTCTGACA AAGGCTTTGT 
54 4 51 TAAATGAGTG GAGGGAGGGA CGGTCTGGGA GATACTTTTT CAGGTGGCAT 
54 501 AGGACCTCCG CTTCTTCCCT TCTCACATGA GAAGGAAGAT TTTTCTAGAA 
54551 ATCTACAGGT GTTTAAGCTG GAATGTGCCT CAGACATCAT CTGGTTGGAC 
54 601 CCTTTCATTT TGCAGATCTG AGGCCTAGAA AGATTTGGTA ACTTGCCCCA 
54 651 GGTCACAGTT GACAGAATTG CTCAGTGAAA AGTCCAGCAT AAATACCCCA 
54701 GCCCATGTGG CCACTGGCTG TGTGCTCAGC TAGTGAGGCA CACTTACTTC 
54751 TTAATTTGTG CCACCCACTT TTCAGGCTCC CTTAGGACAG CCTCCACCTG 
54801 CTCCTACTGT GCTTCCCATC GTCCCTCTCC TCAGGCACAG GCTGAGGAGT 
54851 AATAAGAGCA CCTGATATGT GTCAGGCCTT ACTGTGTGCT AGGAATTGTG 
54 901 CTAAGTACTT CCTATGAATT TTCCATTTAT TCTTTATAAT AACTTTGTAA 
54 951 AGTTAGAGCC ATTATTCCAG AAGGGAAAAC CGAGGCAATG GGAGTCAAAG 
55001 CAAAGAATTT GGGCTTTTAA CCATTACACT ATTTTGCACA AGTAGCCAGT 
55051 AATGAAAAGG CTGCTATCCG GAATCATCTT TGCAAAAGGT AATTTCTTTA 
55101 GCACTTTATC AGAAGAAGGG GGCTCCTTCC TCAAATTCTG AGGGAAGAGA 
5 5151 AGTGGGGAAG AAAAGATGAC TGAATCCAAA GCTCGGGCAG GGAAAGCACA 
55201 TCGAGTGCCA AGTGCGCTGC GCTGGGGTCT AGTCCTGACT ' CAGCCGCCAT 
55251 CTTCCCAAGT GCTTCCTGGA ATTCTCTCCT CTCGTGGGGC CTCAGCTCCT 
55301 TCATCTTAGG AAAGAAGGGT AAAGATCTAC AGACAAATTG ATCTTTAAGT 
55351 ATCCTTAGAG CACTACCATT TTCAGAATCT AGGATTCTAT ATCCTTCCAA 
55401 TTATCTCTGT GTAGGGAATT ATTGGTCGTG TCTCCTGATT AGGGAGCCGG 
554 51 ACACTCGTCT GTCAGCCCCA CCTGGCTCTG CAAAGTCCCT TGTGTATCTG 
55501 CCCTGCCTGG TCACGGGAGA GGAAGAGACA AGGAAACACC ACCGCTCCGA 
55551 CTCTGTGGAG CACGCGCTCT CTCCCACCCA CACACCCGCT CAGGAGAGGA 
55601 GGAACCTGCA CATTTGAGTC TCCTCAGAGC CTCTGCAGAC TCCCAGCAGG 
55651 GGTCTGGCTT TCCTCTCAGG TAGCACAGTC ATGCTGTAAA CTCATTTGGG 
55701 TCTTGCTTGG TATGATAATG CGTTTAGTTG AAGGGTTATA TAATTGCAGA 
55751 GTCGATGATG ATCTCTAGGC CAATTTAAAG TCAAAGCTAT TTTTAATGGA 
55801 ATTGCCAGAG GAGGGCAGGG ATGGGGGCAG GGAGGAGAGA TGGTTAGAGA 
55851 GTGCTTTTGA AACCAACCTC CAACAATTTC AGCCATTGCA TTTCCGAACC 
55901 TGAATTTTCA GGGCAGAAAT TGGACAATGC CAATTAAATC AGAGCAGGTG 
55951 TATGTGAGAG CTGGGTTCAC CTTCTTGCAG CTACAGTTTT ATTTTGAATA 
56001 CTGTTGCAGG TAGTGAAAAT ATGACTAGGC TGAATAAGAG ATCTCAGTCT 
56051 ATTCCCAGCT CAGCCAAAAG CCCTTAGTGT GTCCTTGATC AAGTTACTTC 
56101 CCCTATCCAT TTCCTTACCT GCAAATGAGA AGCTTGAACC AAACTATCCT 
56151 AATGTCCCTT TCAACTCTAA AATCCTAGAT GATCCTCAGA TGTCAACAGT 
56201 GCTGAAGCCC AGCACTGTAA GATGTCAGGT GGTCCGCAGA GGGTGAGGCT 
56251 CTTCCTGCTC AAATTATTTC TTCCACCCAA GACTCCTCAG TTACCTCTGT 
56301 ACACAACCTT GCAGGCCCAT CTAAGTATCC AATAACCTGG GGCTTTAGTT 
56351 TACAAATTTT CTTGGGGAAG AAGGT AAAAG GGATCTAGCT TTCTGGGTTA 
56401 TGAATGCCAT GTAGGGAGGG CATGGTTTGA GTTAGTCCTG GTGCTGGG AG 
56451 TTCATGAGAC TTATTCTCAA ATCTTCAGAG AAGAAAATTC CGTGAACACC 
56501 TGGGAACATC AGGAAAAAAA AAATGTCCCC .TAGGCTACTG TCAGGTTAGG 
56551 CTGCTGGTTC TGATTTGACC TTGAACTTGC TATAATTGAA CAAGATAAGC 
56601 ATGTGACCTA ATGAAATACT TTAAAACTTG TAGCTTCCTT CAGCACAGAA 
56651 GTGGCTCTCT GAACCAATTT TAAGCAATCC TGGCTCTATC TGTGCATGTT 
56701 GATTTAGCCT GTGGTTATAG TGTTAACAAT TTAGTGATTC ACCTCATTTT 
56751 TAATCTCTCT TTCCCTTTAG CAGGATCATT TTCTCTGTGT TAAGGGATCA 
56801 ACATTGAGGT AAGAATGGCT AAATAATAGC ATCTTCTGGA ATACAAATGA 
56851 CTTTATAAAT AAAAG AAGAT AAAAGGAAGA AGTAGGATGA TTTCTCAGCT 
56901 CTAATACACT TAGCAAATGC CATATGCTTT CTCCTGCGTG TACTGGTCAG 
56951 GCCAGTTCTA GATACAATCA TGCGCTGCAT AATGATGTTT TGGTCAACAG 
57001 TGGATTGCAT ATGTGACGGT AGTCCTTTAA GATTATAATA CCATATTTTT 
57051 GCTGTGCCTT TTCTAGGTCT AGATATGTTT AGATACACAC ATACTTACCA 
57101 TTGTGTTCCA ATTGCCTACA GTTTCCAGTA CAGTAACCTG TTGTACAGGT 
57151 TTGTAACCTA GGAGCAATAG GCTATACCAT ACAGCCTAGG TGTGTAGTAG 
57201 GCTATACCAC TTAGGTCTGG GTAAGTACAC TCTATGATGT TTTCACAGTG 
57251 ATGAAACTTC CTAATGACAA ATTTCTCAGA ATGTATCCCA GTTGTTAAGT 
57301 GAGGCATGAC AGTACTATAT CTCAAGACTG TCCCCAAGCT GAAGTCTCCA 
57351 GTGGACACAA AGACCAATGT ATTTAGTTGA ATCGTGGACC CCAAAAGTTC 
57 401 AAGTCCACCC AGAACCTCAG AATACAAGTT CAAGTCCACC CAGAACCTCA 
57451 GAATACAATT TTATTTAGAA ATAGGGTCTT TGCAAATGTA GTAAGTTAAG 
57501 ATGAGGTCAT ACCAGAGTAA AGTGGGCCCT AAATCCAATA TGACTAGCAT 
57551 CCTTGTAAGA AAA GG AAAAG GAACACAGAC AGGGGAGAAG GCCATGTGAG 
57601 AACAGAGACA AAGACTGGAG TGAGGCATCT ACAAGACAGG GAACACCAAG 
57651 GATTGCCAGG AGCCACCAGA AGCTAGGAAG AAGCAAGGAA GCATCCTCTT 
57701 CTGGGGCCTT CAGAGACAGG ATGGCCCTGC TGACACCATT GTTTCAAATG 
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577 51 TTTAGCCTTC AGAACTGTGA GACAATAAAT GTATATTGTT TCAAACCATC 
57801 CAGTTGGTGG TACTTTGTTA TAGGAAACTA ATACATTCAG GATGGAGAGG 
57851 TGTCTGGGAA GCCCATGAGA ACAAATGGAA AGAGCCAGAA GCCCTCAACC 
57901 TTGGCTCGTC TACAGCCCAT TTTCTTCATT CCCGCATCCA GGCTTTGAGA 
57951 TGACAGGAAG CTGTGAAACC TGTGAATTGT CTCCACCGCA AATCCTGCTC 
58001 CCTGGTCCCA CCTAGACTGT CAGGGTTGTG TGGCAAGGCT TTCATGCCTC 
58051 TCACTGACTG CCTAGTACGT CCCCTCAATG ACTGGTCCAC ATCTTTCTCA 
58101 CCTTTCTCAT GCATGGCCCC AGATCCACCC CAGTGCCTCG TCCTCAAGAG 
58151 GTGATTTATT CCGAGACACT GATGAGAGCA CTGTCCTTCC TGTGTCTGAG 
58201 GGAAGGCATG TAACTCTTGC TTATCTTCAC CTGTGCTCTA GATCCTGACC 
58251 TTCTCTGGCA ACCTCAGGGA CCTTGCACCA TCCATTCTTC TCGCCTAATG 
58301 GCGAGACTCA GTCTCTCCCT CTCCCTTTCC ACTCTCCCTT GCCATTCTTA 
58351 GTATCTTTCT ACAAGCAGGT CTTCCAAAGT ACTGCTTGAG GTCTGAGTTG 
534 01 GAGGGAACAT GCCTCTACCC TACTAAAAAG AGAAATTCCT CTGCAGAAGA 
584 51 CCCAAGCTGA CTGACAAATC CCTTTACTGC AACTGCAGCT CTAGCTCCCA 
53 501 CCATTTTCCT GTACTTACTC TCCTGCTCAG GTTCCCTGGC ATTGCTGATG 
58551 TCTTTCAGCC TTTGTGCCCT GGCCCCTTTC CTCCTCTCCC CTCATCTAGC 
58601 ACTACCTGTC AAAATCAGGG ACT T ACTT T A AAATTTATCC CAAATTATCA 
58 651 TTGCCATCAT CTCCACTGTC ACCTTATCAT ATGTTTGAAT AGCGTTTCCA 
56701 TTTCCCAAAT GTTTTCGCAT GCACTTTCTC AATTGAGCCT TACGAATCCT 
587 51 AGAGCTGAGA AGGGTAACAA TTTATGAGTC CTTTGACAAA TGTGGAAACT 
58801 CACATCACAG AAAGTAAGTT GCCAGCCGAT ATGTCACTGT CTTCAAACTC 
58851 TTCTTTGTAT TTTTATTATC TCCCATTATA TTCTGCCTCT TGTAATGATT 
58901 ATTTCTACAT TGGTCATATC TTTCCTTCTG TACTGATCTT CGCTTATGAT 
58951 AACAAATAAT AATAGTTTAC CTTTGCATCA CACTTGATGG TTTACAAAAT 
59001 GCTTCAAATT CAACATGGCC CTTGATCCTG A AG AT AT TT A TCACTTAAGA 
59051 ATCATTATCG CCATTTTAAA ATACAAATTT ATTACTTGGG CTAAATTTTC 
59101 TTATTATAGT TGGGATAGGC CTTCATCCAT AGGGTGAGTG CAGTATTTGT 
59151 GGACTGTCAT GGCAGCTTAA ACATTTAGTA CTTGAAAATC TGATGCATTG 
59201 ATCATCAGAG AAATGCAAAT CAAAACTACA ATGAGATATT ATTTCACCCC 
59251 AGTTAAAATG GCTTTTAGCC AAAAGACAGG CAATAATGAA TGCTGACGAG 
59301 GGTGTGAAGA AAACGGAGCT TTCATACACT GTTGGTGAGG ATGTAAATTA 
59351 GTACAACCAC CAGGGAAAAC AGTTTGGAGG TTCCTCAAAA AACTAAAAAT 
59401 TGAGCTACCG TGTGATCCAC CAATCCCACT GCTGGGTATG TACCCAAAAG 
59451 AGAGGAAATC AGTATATGAA AGAGGTATCT GCAGCCGGGC GCGGTGGCTC 
59501 ACGCCTGTAA TCCCAGCACT TTGGGAGGCC GAGGCAGGCA GATCATGAGG 
59551 TCAGGAGATC GAGACCATCT TGGCTAACAC GGTAAAACCC CGTCTGTACT 
59601 AAAAATACAA AAAATTAGCC AGGCGCGGTG GCGGGCACCT GTATTTCCAG 
59651 CTACTCGGAA GGCTGAGGCA GGAGAATGGC ATGAACCTGG GAGGCGTAAC 
59701 TTTCAGTGAG CCGAGATAGC ACCACTGCAG TCTGGCCTGG GCGAAAGAGC 
59751 GAGACTCTGT CTCAAAAAAA AAAAAAAAAA AAAGAAAGAG GTATCTGCAC 
59801 TCTCATGTTT GCAGCAGCAC TGTTCACAAT AGCTAAGATT TGGAAGCAAC 
59851 CTAAGTGCCC ATCAACAGAT GAATGGATAA AGAAAATGTG GTACATATAT 
59901 ACAATGGAGT ACTATTCAAT AAAAAAAAAG AATGAGATCC AGTCATTAGC 
59951 AACAACATGG ATGGAACTGG AG AT C AT T GT GTTAAGTGAA ATAAGCCAGG 
60001 CACAGAAAGA AAAACATCTT ATGTTCTTAC TTATTTGTGG G AT CT AAAAA 
60051 GCAAAACAGT TGAACCTATG GACATAGAGA GTAGAAGGAT GGTTACCAGA 
60101 GGCTGGGAAG GGTGGTGGGG GGCTTAGGGG GAGGGTGGGA TGGTTAACTG 
60151 GTACAAAAAC AGAAAGAATG AATAAGGCCT ACTATTTGAT AGCACATCAG 
60201 GGTGACTATA GTAAATAATA ACGTAGCTGT ACATTTTTAA AAAACTTGAG 
60251 TATAACTAAA TTGTTTGCAA CTCAATGGAC AAATGCTTGA GGGGATGAAT 
60301 ATGCCATTAT TCATGATGTG CTTATTTCAC ATTGCATGCC TCTGTCAAAA 
60351 CATCATATGT ACCCAATAAA TATATACAAC TACTACATAC CCACAAAAAT 
604 01 TAAAAGTAAA AAAAAAAATT AAGAAAATAA AAGAACAAAA GTAGATGTAT 
604 51 TCTACATGTC TCCATATTGT AAAACTAGAA CCAGTCAGTT AACTTTAGAG 
60501 GAAGGGGATT GTGGACTTGA TATAAAGACA ACTTTATAAT ATGCAGAGCA 
60551 GCCTAATCCT ACAATTGTCA AAAAGTATAG TGGATTCTTT ATTTATTTGT 
60601 CCATGATATT ATAGAGGTCA TTTCTGCTTT AACAAGTAGG TGGGAGATAG 
60651 CTAGGTAGGA TATATTTTGT TCTTATTTTT TATTTTAAAA TATTGGGCTG 
60701 TGGCTGGACA TGGTGGCTGA AACCTGTAAT CTCAGCACTT TGGGAGGCTG 
60751 AGGCAGGCAG ATCACCTCAG GTTAGGACTT TTCGAGACCA GCTTGGCCAA 
60801 TATGGTGAAA CCCCATCCCT ACCAAAAATA CAAAAATTAG CCAGTTGTGG 
60851 TGGCATGCAC TGTAGTCTCA GCTCCTTGGG AGGCTGAGGC AGGAGAATTG 
60901 CTTGAACATA GGAGGTGGAG GTTGCAGTGA ACTGAGATTA CGCCACTGCA 
60951 CTCCAGACTG GGAAACAGAG TGAGACTCTG TTTTATATAT ATATATATAT 
61001 ACACACACGT ACATATACAT GTATATATAT ACACATTATT ATTGAAAGCA 
61051 GCCAAAGAAA AATAACACAT TATATATAGA GAAAGAGCAA ATGATGAGTG 
61101 ACTTTATATG TATATATATG TGTGTGTGTA TATATATAAT GTGTATATAT 
61151 ATACATATAT ATATATAGGT TAAGAACCTT CAGCACATGT ATACCTATGT 
61201 AACAAACCTG CATGTTCAGC ACATGTATCC CAGAACTTAA AGTGAAAAAA 
61251 AAAAAAAAGA ACCTTCTGCA TGCCAGTAAC TGTGCTAAGT GATTAGGATG 
61301 CAATGGTAAT AAAAACAAAG TCCCTCTCCT TAAAGAATTT TCTATTTAGA 
61351 AGGGAAAACT GGTAAATAAA AAATAAATAT ATAAATTACA ATTTGTGAAA 
61401 AGTGCTACAC ATGAAAGAGT GCTGAGACAG ACATCAATGG ATAAACTTTA 
614 51 GATTGAGAAG GGCTCTGACA AAGCAACATT TAAGGTGCAA CCTGAGAGAA 
61501 TAGAAGTTAA ACAGGCAGAT ATTGGTGAAA GAGCAGTCTA GGCAGAGGGA 
61551 ACATCATTTG CAAAGGCCCA GGGTAAAGAA GATCCTGGTA AGGAAATGAC 
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61601 AGTGGAAGAA GGTTAGTGTA GCAGGACTGT GGCTAGGGCG GAGAGGCAGG 
61651 GAAGTAGTTT AGAATTTCAA TGCAATAGGA AATATGGAAG ATTGAAGGCA 
61701 GTTTTGCATT AT AAAAT AAT ATGATTGCTA TTTTAAAGCT ACTTTATCTA 
61751 AGGATGGAAG ATTCTTAAAT AAACTTGTGT ATACTTGGAC CACACCACCA 
61801 TGAGCAGCAG CTGCTCTAAT TCAGAGCAGT CCTCCTGCCA AACGCTGTGT 
61851 GAGACAAAGC TCTGATTCAT AAAGGGGCAT TTTTCTCTGG GAGAAAACCA 
61901 GTGATCCATC TGTAGAAGTA CCTGAGTCTA AGGGGAGACG AAGCAGCAAA 
61951 AGAAATTGGC TTGTGAGGAC AGGGACATTG TAAGAATGAA AAGAGGAAGG 
62001 GAGGTGCTGA GCCCTTTTTC TTTTTTCTTT TTCATTTTTC TTTTTTTTTT 
62051 TTTTTGAGAC GGAGTCTTGC TTTGTCGCCC AGGCTGGAGT GCAGTGGCGT 
62101 AATCTCAGCT CAGTGCAACC TCCGGCTCCC GGGTTAAAGC GATTCTCCTG 
62151 CCTCAGCCTC CCAAGTAGCT GGGACTACAG GCCCTTTTTC TTAATCCACA 
62201 ACCTTCAGTT GGATTTTGCA AATGAGTCTG TCTTCACTGT TTCCATTCAG 
62251 TGGCTGGAGA CAACTTGGAA GAGAATCTCA GAAATAACTC TGGCTGCTCA 
62301 CCCAGTTGTT TGTAAATTTT TATTGAGACT CTACTGTGTG CCAGGCTGTA 
62351 CCAGGCACTC AGATATGACA GTGAATGAGA TAGGCAACAT CTTTGCCATT 
62401 GGAGAGCCTA CACTGAAGTG GACATGAGGG AGTTGAAAGC AACTCTTATA 
62451 GGAAATCATG GTAAAGACGT CCAAGAGAAG AAAGATGAAG GGCAAACACA 
62501 TGCACGGATG CCAAACATCT ATCAGAGAGA AAGGAATTTT CAGACCTGAC 
62551 CTGAATGATG AAAGGAGGTT TTTGGAAAGG AAAAT AGAAG GGAAGGACAA 
62601 GGGAAATTAT CTGGGCAGCA ATATTTATCT GCTGTGGTGC TTCACTCTCT 
62651 CTCTAATCCT TTTCCACCCC AGCCCCAAAT TTGAAAGGAT TGCAGGGAGC 
62701 TCCTGCTGGA GTCATTTCTG GTATTAAAAA TGTACAGAAA GGAAAGCTTT 
62751 GGTTCTGAGT TTGCAGGCTT CCCTGTCTTT CATTCCTATT GTAGAAAGCA 
62801 GCTTATATAA AAAGATGTGC TGTGTGGCCC TTTGAGCTGC TGTGATTGTG 
62851 TTAGGACCCC ACTGGATGGT ATTCGCATGA ATTAATCTAC TGTAGCATCT 
62901 CTACAAATCA AGAGGCTGGC TTCTGTTTGA AATGTCCCAA GGCTTTGTGC 
62951 ACAGGGCAAG CTAAATGTCT CCCTACAGTG AGACTGAAAA TGCCTTGGGT 
63001 GCCCTTGTCG ATAGGATCTG ATATATAGAT GCATGTCTAC AATTGCACAG 
63051 TGGCTGCTGG CAACATTTAT TACAATCTGA ATGTGAAATG GCTATTCTGT 
63101 TCAAGGATTC TGATAAAAAG TATCAGCCAC AGTAGATGTA TAAGGAGCCT 
63151 GGTTTCACTG CAACTGACTA CAGTTATCTG ATTTTTTTTT TCTAGTTCAT 
63201 TTTTAGTCTG TGGAGCAAAC AGAGATTTCC TCCCCAAATG ATGTCCTTTC 
63251 TCAGTCACCA GGGTGTGGTT ATTTGGTTTT ATGTAGAGGA GATAGAAACC 
63301 AATCAGTCTA AATCATATTC TGTTGAAATC AGAACCAAAG GATCCACAAT 
63351 CTGGCTCCAA TCTAACTTTC CAGCCTCAAC TCCTACCTGT TCTTTGTTAC 
63401 TCTTACCCCT CTAAACCACT TGTGGGATCC TGAACTTGTA ACCTGTGCTC 
63451 AGACTGGTGC TTTTGCACTT CTCTGATGGG AAAGATTTCT CTCATCTTTT 
63501 ATGATTCAGC TGAAGTTTCA ATGCTTCTGA AATTTTTTCC TGCTCCTGCT 
63551 GGAGAGCTTG TTTCTTCTGG ATTCCCATAG GTCAGGTCCT GTGTTTGGCA 
63601 TTGGGATACA AAGCCAAGTA ACATAGCATC CATATTCTCA AATCCTCACA 
63651 ATTTGGTAGG AATATAGACA AGTAAATACA CCCTGTGCAA CCTTTTGTAA 
63701 CAGAGGTATA AAAGGGTATG AAATAAAGAA TTTAATCAAA TCAAATTGAA 
63751 TATGGGCTTC AACTCTGAGA TCTTCTTCCA TGATGAGGTT CCCAGTTTAC 
63801 TCTAGTGAGG TCATGATTCC ATACTGGCAC TCTTCTAGGC ACATAAGGCT 
63851 CTATCCTATT ATTAAATAAA GATTATTACC ATTCT r CACTG CAAGCAGCAG 
63901 CAACCTGACA CCATCATCAT CAT AAAAT AA GTAAAACNAG AGTTAATTAA 
63951 GTGTGAACTT TCTAAACCAA CATTGTATGA GATAATTACT CATAAAAATG 
64001 ATTCTTCACT TTCCAAAGGT GCCTCTAAAT ACTAAGATTT CAGTTACAAT 
64051 AAAACTTAGA TCCAATTTAC AGATATTAAA TTTGGTCCAT TTTCCAAGAA 
64101 TATTTTCTTT TCTCATAAAA TAAAAAAAGT ATGTGAGAAT ATTAGCACAA 
64151 AGGGGTTGCA AAATAAATTT TATTTATCCA GATGTGAGAT AAGAGGCACA 
64201 TGCGTCTTTT TTCTTGTTTT ACTGCACTGG TTAGGACCTC TAGTATGTTG 
64251 AATAAAAGTG GTAAGAATGG ACATTCTTGC TTTGTTTCCA GTTTGCTTTA 
64301 ATATGTTTTC TGTCAGTTTT TCATAGATGC CTTTTATCAG ACTGATTAAT 
64 351 TCAGTCTATT ATTATTTCAG TATGTTATTC AGTTTATTAT TTCATAATAA 
64 401 TTTTTTAAAC CATGAATGAG TTTGAATTTT GTCATTCCTT TATGTATCTG 
64451 TTGAAATGAT CATATCGTTT TGCTTTCTAA AGCTTCTAAT ATGGTTTAAT 
64 501 CACATTTATT GATTTTTCAA ATGTG AAGC A AATTTAAATT CATGGCATAA 
64551 ATCCTACTTG GTCATCGATG TGTTATCCTT TTTGTATGCT TCTGGGTTCA 
64 601 ATCTGATACT ATTTTGTTAA GTATTTGTGG TGTCTTTTCA TGAGAGATGT 
64 651 TGGTCTGCAA TTTTTTTTTC TTGTAAGGTT TTTGTAAGGG TTTAAGAAAG 
64701 CAAGGTCAGG TAAGCTTCAC AAAGT AAGTC AAGAAGTATT TTCACCTTTA 
64751 TCTTCTGAAA GAATTTATGC AACGTTGAAA TTATTTGTTT CAGAGATGGT 
64801 CAACAGAATA TACCAGAGAA ACTATTTGGA CTTAGAGCTT CCTTGGGGGA 
64851 AGGTTTTTGA TAAATAATGC AATTTCTTTA ATACATAGTA CTTATATTTT 
64 901 CTATCTTACC TTGTGACAAT TCTGATGAAT TGTGTTTTTC AAGAAGTTTG 
64951 CCCATGTCAT CTGAGTTGTT AAACTTACTA CAACAAAGTC TTTGATAATA 
65001 TTCCTATATT AGCCTTTGAA TGTCTATAAG ATCTGTCCTG ATGTTCCCTC 
65051 TCTCACTTTT TTAAAGAAGT CTTGCTAGAG GTTTACCAAT TTTATTTTGT 
65101 TTTATTTTAT TTTATTTTTT CTTATTTGAG ACAGAGTCTC GCTTTGTCGC 
65151 CCAGGTTGGA GTGCAGTGGC TCGATCTCGG CTCACTGCAA GCTCTGCCTC 
65201 CCAGGTTCAC GCCATTCTCC TGCCTCAGCC TCCCGAGCAG CTGGGACTAC 
65251 AGGCACCAGC CACCATGCCC GGCTAATTTT TTGTATTTTT AGTAGAGACG 
65301 GGGTTTCACC ACGTTAGCCA GGATGGTCTC GATCTCCTGA CCTTGTGATC 
65351 CACCTGCCTC GGCCTCCCAA AGTGCTGGGA TTACAGGCGT GAGCCACCGC 
65401 GCCTGGCCGA GGTTTACCAA GTTTATTAAT CTTTTCAAAG GACTACATTT 
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65451 TGGCTTTGAT AATTTTTCCT ATTTTTTATC TACATTATAC TGATTCCAAT 
65501 TCTTATCTTT ATTCTTTTCT TCCTTCTCTT CACTTTGGGT TTAATTTGTT 
65551 C ATT TTTTTT TCTGGCTTCT TGAGATAGAA GCTGAGATCA TTGATTTTGA 
65601 ACCTTTCTTC TTTTCTAAAT AAGTGCATTT AAACTTACAC ATTTCCCTTT 
65651 AAGCACTGCC TTAGCTGTAT CTCACAAATT TTGATATTGT CTTTTCATTG 
65701 TCTTTTATTC AATATATTCT AATTTTTCTT GTGATTTCTT CTTTGGCCCA 
65751 TAGGCTGTTT AGAAATATGT AGTTAGTTTC CAAATATTCG AAGACTTTCA 
65801 CAGATACCTT ACTATTATTG ATTTCTAATT TAATTCTGCT ACAATCCAAG 
65851 TATATACATT ATAAAGTTTC AGCCTTTTGA AATGTATTAA GAATATTACC 
65901 AGAGATAAGA AGATAAGAAT ATTACCAGCG ATAAGTAGGG ATATTTCATA 
65951 AATAATAGAC GAATTGATTC ATCAAGAATA TACAACAATC ATAAATGTGT 
66001 ATGTGTCTAA TAACAGAGTC TCAAATTATA TGAAACAAAA CTGACAGAAC 
66051 TAAAGAGAGA AATGGCCAAT CCCACAATCT TTATCTTTAT CAGGTGATTT 
66101 ATCTTGGTGA ACATTCCTTG TGCTCTTGAA AAGAAAGTGT ATTCTGTAGT 
66151 CATTGGGTAT AAAATTCTAT ATATGACAAT GAGGTGATTG ATAAAATTAT 
66201 TTAGATTGTC TATATCCTAA GTTTTGTAGA ATTATTTCAT GAATTACTAT 
66251 GACAAGGATG TTAACAACCT ACAGCTATGA TTGTGGAATT GGCTATTTCT 
66301 CTCTTTAGTT CTGTCAGTTT TGTTCCATGT AATTTGAAAC TCTGTTATTA 
66351 AACACATACA TTCATGATTG TTGTATCTTC CTGATGAATT GGTTCCGTTA 
664 01 TTATTTATGC AATGTCCCTA TTTATCTCTG GTCATATTCT TTATCTTGAA 
664 51 GTCTTTTTAA CTGATATGAA TGTAGCCACT TCATCCTTTT TATGCTTACC 
66501 ATTTGCATAG TTTATATTTT TCCATTATCT TATATTCACA CTATTTATCC 
66551 CTTTATACTT AAGTCCATGT CTTGTAGACA GTATGCAGTT AATTGTGTCT 
66601 TGATTATTTT TACTCCTTTC TGACAATTTC TGCCTTTCCA TATAATATGC 
66651 TTATCAATAC AGTTGGAGTT AAATCTACCG TCTTGTTATT TGTCACATCT 
66701 CCCATCTTTT GTTGTTGTTC CTCATTTCCT TGTTTATTAC CTTCTTTTCA 
66751 GTTATTTTTT TTTTGTATTC CATTTTAATT CCTCAATTGG CTTTATAGCT 
66801 ATATATCTTT GTATTATTTT TTATTGTTTG CTCTAGGGAT AGCAATATGT 
66851 ATACTTACCA CAGACAATTT AGAAATCATA TTGTACCACT TCACATAAAA 
66901 TAGAAGAAGC TTGCAGCAGT CTATGTCCCT TTACACTCCC ATTCTTTGTG 
66951 CTATTGTTTC CGTATGTATT ACATCACGTA CATTGTAAAA TCCACAATAG 
67001 AGTGTTATAA TCTTTTTCCA AATCCTTGTG TGAATTAAAA ATTTTATGAG 
67051 TAGAAAAATA CATATAACAT TTTATTCTTA CCT AC AT ACT TACCAGTTCT 
67101 GCTTTCTTTT CATTCTTACC TGTTTCAGTC TTATCTGTAA ACCCGTTTTC 
67151 ATTTGGTGTC ATTTCCATTA GCATTTCAGT GCAGAACTTC TAGCAACATA 
67201 TTCTCTATTT CCATGTATCT TAAAATATCT TTATTTTGCC TTCGTTTTTG 
67251 AAATATATTT TAATTGGACA TAGAAATCTA GGTTGGCAGT TTTCTCTTAT 
67 301 ACTCTTGGGT TTCATTGTCT TCTGATTTCT GTTGTTTATG AGGAAAAGTC 
67351 ATTGATTATT TGCTCTTTCT CTATACACAA TGTATTATTT TTCTTTGGCT 
67 401 GTTTCAAGAT ATTTTTCTCT TTATCTGTGG TTATCAACAC TTTGATTATG 
67451 ATGGCCTAAG TGGTATTATT GTTGTTTGTA TTTATTCCAC TTGGTGTTCC 
67 501 TTGAGCTTCT AACTTCTGTG AGCTTTTTTT TTCTCAGCGA ATTTGGAAAA 
67551 ATTTAAGCCA ATTATTATAT AATTTTTCTT CTCCATTCTT TCTACTCTCT 
67 601 TTGGAACTCC AGTTGTACAT AGGTTAGACT GCATGACGTT GTCCCATAGA 
67 651 TCACTAAGAC TCTGTTCATT TTTCAATTTT TTTCTCTATG TTCTTCAGAT 
67701 TGGACAATTT ATCTTGATCT CTATTAATGT TCACTTATCC TTTATTATGC 
67751 CACCTTCAAT CTGATATTAA GGCCATTCAG ATCTAGAATT TCTATTAGGT 
67801 TATTATTTAT AGTATTAATT TCTCTGCTAA GATTTTTTGT CTGTTCATTC 
67851 ATTATGACCA CAATATTAGG TTCTTAAACA TATTTTAATA GCTGCTTTCA 
67 901 AGTCCTTGTC AGTTAATTCC ATCTGAGTCA TCTTGGGGTT ATTTTCTATT 
67951 GAGTGATCTT TACCTTATCT GTCGGTCACA TTTTTTTCTG TTTCTTCACA 
68001 TGTCTAGTAA TTATTTATTG TTTGCTGTAT ATTGAAATGA AATATTATAA 
68051 ACAGTATCAA TTACATTATC TTCCTTTTAA GGGTATTGAG TTTTGTTCTG 
68101 GAAGTAGTTA AATTACTAGT AGAACTTTTT GTTCCTGTCA AACTTGATCT 
68151 TATTCTTTGT TACAGTGAGC CTATTTTAGT TTTAAAGTTA GTCCTAGGGT 
68201 ACAACTCTTG CTCTATTGTA TGCTCCTTAC TTCTATCACA TTTATTTCTA 
68251 TTGCCTGAGA TAGTCAATGA GTTCTCACCT GAGCAGGAAC TGCAACATTT 
68301 CTTGACATGG TCTTACCTAT GTATTCATCA TTCATCTCTC AGGCCTGTAA 
68351 GAAGAGATCT CTGTTGGGTC CTGTGGAATC TTGCTTGCAC TTGGACAGCT 
68401 CAGCCTTCAG CCAAAGACTT GCAGGAAAAC CCCATAGAAA CATCTGGGCC 
684 51 CTCTCAATAT TTGATGTTTA GGAAGCTAAA CGTCAAGTAT AGCCTCCTTT 
68501 TCTAGGGACC CTATCTTGTG AATTTCACTC ACCTTAACAA CTCAGAACTC 
68551 TTATCTTCTG CCTTCTCAGG GGAGCTAAAC TGTCACTTTC TGTGGGCTCC 
68601 ATCTTCCTGC TCCACAATAG GAAAGTATCT GCAGAGAAAA GGCTGGACAA 
68651 TTGTGTAGTA ATTGCTTCAC GCATTTCCCT TCTCTCAAAG ATTGTAAGTT 
68701 TGCACTGTTT GCTGTTCAAT ACCTGAAAAT GATTTCTACA AATTGTTTTT 
68751 CCAGTTTTAT GATTGTTTTC AATGGGAGAT CATTTCTAGT ACCAGTTCCT 
68801 CCATCATGGC CAGAGGTACA AGTTCAACTT GGATCATTTT AAAAATACAA 
68851 ACTGGGGCAT GTCACTTCCT GCCCCAAACC CCTTGGTAGC TTTCCATTGC 
68901 TCTTAGAATA ACTTTGTGAT CTACAACATC TTCTTCAAGG CCCCGCATGA 
68951 TACAAATTCT GGCTATTTCT CTAGTTTCTT ATTGCACCAC CTTGTCCCTC 
69001 ATCCACCTTT TTTTTAGTCT TCTCTCTTTC TTTGAACTTC TACCACCAGG 
69051 TTTTTTCACA CGTTCTTCTT TCCCCATTAA CAATGATCCA CCATTCTCTT 
69101 TCTTTATCCA CTGTTACTCA TCCTCATAAC TGAAACATCA TTTCCTAAGG 
69151 ATGGCCATTC CTGGTTCAGT CAGTCTATAT TTCATCCCCC ATCACATACT 
69201 CTTGTTTTAC CCTATATTTT TCCTTCAAAG CACTTATTTA AGTTGTAATT 
69251 ATGTGTTGTT TATTTTATGT CTGTCTGCCC TCACAGAATC CACAGTCCAG 
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69301 GAGAACAGAA ATCCTGCCTC TTTTATTTAT ACCACATCCA CAGTATTATT 
69351 AGTGCCTGTC ACCTAGTAGG TATGCAGTAT GTACCTATTG AATAAATGAA 
69401 TTGACTTCTG TCTTTTAGAT CGTCTACTCA TTTTATCATT GATGACAAAC 
69451 ATAATACCTT ACATTCGTGT AGTCTTTTTC ACTCCTCAAA GAGGATTTTC 
69501 TGCATAGCTC CTCTGAGCCT CACAAAACCC TTTAAGGAAG ATTGTGAATA 
69551 TTATCAGATA AAGATTGTGA GACACAGAAA AGCCAGATGA TTTGGCAATG 
69601 CTCATAGTAC CAGAGGCAGA AATACAGCTA GAACAGTCTC CTGGCCTCTA 
69651 ATCAGGAGTT CTTTCCAGAA CACTGCTTCA TCTTCCATTC TCTTGGGTTC 
69701 TTTCTATCCT TACTTTATAG GGCAAAATGT GTGCAAAGTA TAATCCCTCT 
69751 TTTGCAATGT GTTTTTAGTT TTTCAGATTG GAATCATGTA GGCTTTTTAT 
69801 GCCCTTAATA AATATCAGTG AGCACAAAGG AAGTCCTGTG AGGGCTTATA 
69851 ATCATTTTGC TCCCATTAAT TCCAACACTG AGCAGTTTCC CCATTTCCAT 
69901 TCTTGGCCTT GTGAAGCTCT TTGCTATCCC TGTTAAAATC TAAAGTTGCT 
69951 TGAACCTTCT TATTGCAAAA ATGCATCTTA AACATTCTAA TACCTCTTTT 
70001 TTAAAAAACC AATAAAGACT ACGTCAAAAA TCAGCCATCA ATCGAGAAGC 
70051 CCTGCAGTCA TTTGTGTGCT GTTGTCCCTA AGTAGAAGTG AATGTGCTGA 
70101 GCTCTGCATT CCCCACCTAG CTCCTCTGTG ATCAGGGTGG ACATTCCCAG 
70151 GACAACTGGG CCGAGGCTGG AAACACCATC TGAATGTCTG ACCACACAAA 
70201 GTTGAGTGGC TGATCCAGGT TTAACCTTGA CCTCATCAGC ACCACCTTCT 
70251 AAGCAACACT TTGGCTCAGA AGCCCAGTTA TTTATTCCAA GGGATGATTG 
70301 AATGCAGTGC TAGTGTTTCT TCAGGGCTTT TGAACTCATT TATTTATCCA 
70351 GTCATTTATA AAAGATGAAG AGGAGAACAA GGTAGGCCAA AGTGGCTTTG 
70401 TACT ATT AAA GGCTGCTTGA TTTCTAAGTA CATGTTCTTT GCCACCTTTC 
70451 TGCCATTCCA CATTCTAGAA GCCATGGGTA AGTCAGCACA GGGATCTTAA 
70501 CATGATAACA TTGGTTTTAG GAGGTCTCGT GCATAATGGA CCAGACTTAG 
70551 AGCACAATGC TGTAAGGTAG TGATTTAGGT GAGCAGCAGA TTCTGGCTTT 
70601 AGGAGTTTAT TATCAGATGC TTTTTAAACG ACTTGTGGCC CAGGATCCCT 
70651 GCACCCATGG GAAGCATTGT AGCCTTAGAA CTCTGGGAAT TCTGAATATA 
70701 ATTCCTGAAT CAATCGTAAG GATGCATATC TGATGCTTAG TGCAAACCAA 
70751 GAGGCAGAAT ATTTGCAGGC AGTGTATCCT TGAAAAACAA ATCTAGGTCA 
70801 TTTTCCTGCC ATGCTTCAAG CTTACTTTTC CATCCTTCCT GATGGTAGTA 
70851 CTAACTACAT TTGTAGACCA TTTACGTGGT CAACACTGTG CTAAGCTGTT 
70901 AGCTTCATTC TCTATGAGAC AGGCACTCTT AGCCCAACTT TACAATTGGG 
70951 AAAACTGAGA CTCAATGAGA TAAAGTAAAT TCTTTACAGT CATTATGCTA 
71001 GTCCATGAAG GAGCTGCGAT TTGCAACTAA ATCTATCTGA TTCCACAGTC 
71051 TTTGCTTTTA AC C AG AGGTT AGCAAACTAC TTCTGTAAAG GGAAGACAGT 
71101 AGTTATCTTA ATCTTTGTGG GCAACATAGG GTCTCTGTAA CGTATTCTTC 
71151 TTTCTGTCAC AATCTTCTGG AATGTAAAAA ACATTTAAAA TTTACAAACC 
71201 TTACAAGAAC AGCTCATGGG CTAAATCGGA CCTGGATTTA GTCTGTGAAT 
71251 CATAGTTTGC TGACCCCGCT TTTTAACCAG TATGTACCCT CCTTCTCGGG 
71301 ATGTGAAAAA TTAGTGCAAT TGCAATGGAA AATAGCAAGA AAATGGTAAG 
71351 GGCCTGGAAG AGGCAGCAGG ATTACATCAG GTGCTATCCC TGCTCTGGTG 
71401 AGATGAAACT GGGGATCATT GAACCACCTG GCATTTGTTA AAGAGTTCTG 
71451 CTTTCCCTCT GAGATTCTTT CAGGAACCTC ACACCTCTAG CAGCCCGGAG 
71501 AACCGTGGGC TGCAAGGAAA TGCCTCCTCA AAGGAGTAGA AAACCTGCAG 
71551 GATAGAAATC ATCACATCTG TCTGGCTTTT CTCAACCTTT CTCTTCTGCA 
71601 CTTTCTTGGA TATAATCAAA GCACTACCAG GAACTCCAGA GTCGGCACCT 
71651 TTTCATTTTT GTGTTTTCAT TTAATTATTT CTCAGCTGCT AAGTGTTTGA 
71701 CTGTTTAAGG GACTCTAGTG GTAAATATTT GTCTTTAGCC TGGCAGAAGC 
71751 TGTGGTTTCC TTTGATGAGC TCACACGGTG TGGCTTTTAA GATGCTGCTG 
71801 ACCAGGACAG CTGACTGTCC CCAGTGGGTG CAGTCCCCAG CAGTGGGCTG 
71851 GACCCCTTCC AGAAAGCGCT GCTGGGCCAA GAGGCTTCCT CCAACTTCCC 
71901 GCTGCCCCCA TCTAACCAAC ACCTCAGTCT CTTCTCCACC TGCTTCCCTG 
71951 CCCTCTTCCT TTCCCTCGCA GACACTTTCT TCTGCCTGGC AAAAGGAATC 
72001 TTGTTTCCAT GGAAGCCTCA TTAAATCTGC ATCTTGCTCA GTTTGGGTTT 
72051 GATCACGGCT GCCAGAAGTA TTTTTAGCCC ATGCAGTTGC GTAATGAGAT 
72101 AGAGATTGGG GAAAGGGGGA GGTGACTGTA TAGGCAGAGG GTTTTTTTAA 
72151 AAAAAAGTGA GAAAGAGAAG GAAAACCTCT AAAGAAAAGA GTTTTATGGA 
72201 ATTGGAAGAA GGATGGAGCA CCTCTTTTGG GAGCATGAGG CTGGTGTTCT 
72251 CTGGTTAGCT CTTCCCACTG GAAGCCCATG GACACTTGCC ATAATACCTG 
72301 TCCTGGTCAC ATGTCAGGGG AACCTCTGAT CTCCCTTTCC ATGAGCTTAG 
72351 TTGGCCCAGC CAGGGTGACA CTTATGCTAG GGAGTGTGAT TGATGTTGCT 
72401 GCTTACAGAT TTCCCCTCCC ACAGACCTGA TGGGGCAGCC AGGATAGTGG 
72451 CAGAGAAGAA GACAGAGCAA TAGCAGGAAA GAGAGGACAA CACTAACACA 
72501 TTGGAGGTTT ATGTTCAAAG ACGGGATCTA GGGGGTCAGA GAAAGCACAC 
72551 CTACCATGTA ATTGGTGCTG GAATCTGATG CCAAGTGCAC CCTTGGCTTC 
72601 TGAGGTTCTG AGAACTCTTG CTTGTGCTTT TCAGCCAGAC TATGCCCTCA 
72651 CCTGCCCCTG TACTTTAAAG AGCTCTTTAG GCTGGAGTGG TTGTTTGCAT 
72701 TGGATTGTTG GAGTGTGTGT GCATGTTGTT GTGTTCTTGT ATTACAAGAC 
727 51 AAAGAG AT T A AAAAAAAACC ACATGCAGCT GTCACAGCTA ATGTTTATTG 
72801 AACTTTTACT ATGCCACATG GTGTTTTAAG CATTCTATAT GTGTTAACTC 
72851 ATTTTCCCTA ATTCTATGGA CTAGACACTT AAACAGTCTC CATTGTACAA 
72901 ACAAGGAAAC TGAGGCACAG AGAGGTTGGG AAACTCATTT GAGGTCCTCC 
72951 AGCTAATTAA TAGTGGAGCC AGGTTTTGTA CCCAGACAAC CTGATTTGAG 
73001 AATCTGCAGT CCTAGATTAG TAACGTGTTG TTGGCCTGTC ACACATTTTA 
73051 AATGACATTC TGTACACAGA ACCATTTATA GTAACTTTGT ATTGTTGAGC 
73101 TGAAAGCAGT CTGCAGATGT GCTGCTGGGA TTTC ATT C AT CTTCAAAGAG 
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73151 GTGTTTTTTT TTTTTTTTAA AGGAAAATGC TTTTCTGAGG GTGGTATCTA 
73201 AATTCATAAA AATCTTTACG ATCAAGATTT TCACAAATTT CATTCTGACT 
73251 CTGTTGCATT GCCCTTCTTC CCATATTCCC AGTTAGTTTG TATTGATTGC 
73301 TGCATCTCCC TTGAGCCCAT GGTCCCCCAC AACATTTCTT GCAGAACTGT 
73351 GTCCTGCCTT CACACTGTCA GGCAGCAGGA GCCTCTCTAG CGGCCAGCCC 
7 34 01 ACAGTCCTGC AGCTCCTTCC TCAGGACGTT TAATTTCCCA CATTTCTATG 
73451 CAGTTACCTC ACAGAAGGAT GGCTACGAGG GCCTCACTTG GCTTGGCAAG 
73501 TTGGTCCCCT TTTTACTCAC AAGACTCTGT TTATCTCTTT GTTTATCTTT 
73551 GTTTATCTCT TTGTTGACCT GCCCCTCTTC AAGGCCTCAG TTTTCTCTGA 
73601 AGTTTACAGC TTCCCTCCTC ATCCCGCAAA AGACCAAAGT GGAAAAGATG 
7 3651 AAACCAGAAT CCACTGCAAG CCCCACCTGC CACAGCCTCT CCTCTAAATG 
73701 CATTCTCTGT TGTGTTTAGG ACTTGAGAAT GAAGAGGGAC ATGAATTGAG 
73751 GATTTGTTTA TTATTCTTTA CAATATCCCT GTGAGCTGAG TACTGTAAAT 
73801 ACCCCCATTT GATACATGAG TAAACTGAGG TGTGGAGTGA TAGAGGAATT 
7 3851 TGCTCAAGGT CACATAACTA GTAAGTGGGT GGAGCTGTGA TGTGAAACTG 
73901 GGCAGTCTGA TTCTGGGACC TGTGCTCTTA ATCACCAATC TATATTGCCT 
73951 CCTACTTGAA AACATCCAGG GAAAATGTTG AGATAGATCA GCTGAAATCT 
74001 TCTTGCACAG TAAAGCAGGG GCCACCTGTC CTGGAGTTAC ATTCATCTTG 
74051 TTCATTGTCA ACGATTTGTG TTCAGTGACA CCCTCTTCAG CCCAAGAACT 
74101 TACCTGGGTG CTGTGACAAT TGGACATGAC TAGGAACAAC CAGTGACATT 
74151 GTAGCCCATC CAAACACAGG GTAGGAAGTG GATGCTTGTC ACTCTCTTTT 
74201 GGTTATAAGA AGCAGGAACC CAGTAAAGGC ACCTTTTATA TATCTATAAA 
74251 GTTGAATATA TAAGATATAT GGGGGCCAGG CACAGTGGCT CACACCTGTA 
74301 ATCCGAACAT TTTGGGAGCC CAAAGCAGGT GGATCACCTG AGGTCAGGAG 
74351 TTCAAGACCA GCCTGACCAA CATGGTGAAA CCCCATCTTT ACTAAAAATA 
7 4401 CAAAAATTAG CTGGGCGTGG TGGCACACAC CTGTAGTCCC AGCTACTTGG 
74 451 GAGGCTGAGG CAGGATACTT GCTTGAACCC GGGAGGTGGA GGTTGCAGTG 
7 4501 AGCAGAGATT GCGCCACTGC ACTCCAGCCT GGGTGACAGA GCGAGATTCC 
74 551 ACCTCAACGA AAAAAAAAAA GAAGATATAT GGGTATGTGT AGAACTCACA 
74 601 GAAGGGCAAA CAGGCCTTAA CAGGTGCTGA AAACAGGAAC TGGGAAGTTG 
74 651 CCAGTACCTT CCTGTCTTTT CCCCTGGAAC CAAACGGTTT CTTACTTGCT 
74701 TCTCTCTGCA CCTCTGTCTC ATTTCCCTCT CTCTTCAGAT GATTTTTCAT 
74751 TGTTGCATCA CACACATAGA AAAATCAGGA TCCACCCTCC CAAGTTTACA 
74801 TATCGTTGTT TCAGGCAGCC ATAGTATCCT TAAAACTCCA CATTCCAGGG 
74851 AGAAAGCTTG GGTCAAGGAT TCAGCCAAAG GGCAGCGAAA TGGAGTAAAG 
74 901 ATGCAACTGC CAGGTCTATG GGCAGCAAGG AGGCCGGGAA GGAAGCCGCT 
74 951 GTTGTGGTCC AAGTGACAAT TCAACAGCTC AAAGCATAAG TAAGTTGTGT 
75001 GCTTTTCACA GATGGAGAAA CTGAGGC AC A GAAGGAACCT GGCTGGGGTC 
75051 CAGGTCTCTG GCCTTTGTGT CAATGCTAGG TCACTGGATG TGGCGTCTGA 
75101 TTTCTACAGG AAATGTGGTT TCTCTACTTT GTCCCAGAGC CCACTCAGAG 
75151 CACTGGCTGG CCAGGGGGTC CTAGGGCCCT CTTAGGATAG TCTCAGGCCA 
75201 ACAGCCCCAG GACAGAAGCA ACCAAAGTGA AGTTATGAAA GAAAGCTCTT 
75251 TGCTGATCTG TCAATGGCAC CCTTGTAGAG CCAATACTTA GAACACCTGG 
75301 ATTTGAATAC TCATCTCCAA AACCTGTGTT CTTTCTACCA CGTGACAAGC 
75351 CCTTGTAAAC CTCACAACGT CTCTATGAGG TGAGCGCTTG CAGATCCACA 
7 5401 CTTTAGATAA GCAAATGGAG GCTCAGAGGG TAAGCAGCTA GTTCAAGGTT 
75451 ATGCACCTGA GCCAGGATGT GGACACAGCT CTGTGTCTGA TTCCTAAGGG 
75501 CCTGTGCTTT AGCCACTTTG CAATACTGCT GCTGTCTGCT TCATTTCCTC . 
75551 ATCTGTCAGA TGGGAACGAT AATACTCAAC TCACATGGAT ACTGTATGAG 
75601 GAAAAACAGA TAAAAGAAGA GAAAGTGCTT TGAAAACATA AGCAGCCCTG 
75651 GCAGATGGGA ATTATTTTTG CTGCTGACAC ACATCCTCAG CCTTGAGGGC 
75701 TCTGCTGAGC CATACCCAGC TCAGAGCTCT GGAGGCACCT CCTCCCCATC 
75751 AACAGCAGGG GGGACATTCT GTCTTCATCC TGAGCAGGCT G AC AAACT G A 
75801 ACCCCACTCC TCCCTCAATG TCCCCATGCT GGGAAGGAGT ATAGCTCATG 
75851 CTGTGTTCTG TCTTGTTGCT GAGAGAATGC AGAACCCAGA ATTTGGGTCT 
75901 CAGCAGGTTG GGGAGAAAAG GAAATGTATT TCTTCCCCCA AGATTTCTTT 
75951 TTGAAATATT TTCATTTGTG GAATCAGATT GTGCATGCAA GTTTCTTCCA • 
7 6001 GAAATGTAAG ACGTCGTAAT GATGGGAACT GTTGGTTTTA TAATTGAAGG 
76051 ATGGGAAAGG AAACTGATAT TTATGGAGCA CCTGTTCTAT ACCAGGCAGC 
76101 TACCCAACCA TCAGCCATTG TTGCAATGTT ATGCAAGCTT TATTATCCAC 
76151 ATTTCACAGT CTGAGTCTGA CTCAGCAATG . TTGTGTTCTA TGTGCTAGTT 
76201 CCCACAGGTA GGTGGCTGCA GCGCTGGGAT TTGAACCCAT CTCCAAAGCC 
76251 TCCATCTTTC TACCACTGCC TCCCATTGGT GGGGAGGCCA TGGACTGGCT 
76301 GTCAGAGATG TCCTTTCCAG TCTAGCAGAC TAGGAAGCTG CTGGAAGCTA 
7 6351 CTTATGCAAA GGTCAGCAAG GAAGGAAACA GAGTCAGAAC TAGATGGGGC 
76401 TCCCCTGGCC ACTTTTCCAT GCTGGCCCAC ATGTCCGGCT AGCAGTCAAC 
76451 ATTGGGTCTT ATGCAGAGCC ACCTGTGTTC AATGGAAACA TCCTGGACAC 
76501 TGC AC AAACT AGTGGGAGCC TGTGAGGGAA CAGCCTGTCG GGTTCATTGA 
76551 GGTTCAGCCC AACTCATGAG CTAGGGCAGG TACCAGAGGG TGTGTTCCAC 
76601 CCAAATGGGG CAGGTAGGCA GGGGACACAG GCTCCATTTT CATGACCAAA 
76651 GACTGAGCAG AGAGGCTCTC TGAGCAGTGG CAGAATGGGA AGTGTCAAGA 
76701 AGCTTTGTTT GACAATTGAG TCAAGAGGAC AGAAAAGACA GAAAGCAGAC 
7 6751 ATCAGAGTTG GGAAGGCTCA CCCCAGCTCC TTGACAAAGG TGCATGAGGC 
76801 CAGTTCTTGA AGCAGTGACC CTGCCTTATG TCATGTGTTT ATCAAAGCCG 
76851 GCCCATCAGC CCTGAAGTGG CCTCTGTGTT TAGAAGAGGG CCTGACATGA 
7 6901 TTCTCTGAGA AAGGATTTGA CAACAACAAA GTGTTGCCGT ATGTGTTGTC 
7 6951 TCATCCCCTC AATAGTCCTG TGAGGTATGT GAGACAGGTG TTACTCTCTC 
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77001 CACTTGGCAA ATAGGGAAAA GAGGGCCCAG AGAAGTGAAG CTGCTTTCCC 
77051 AGGACCACAC AGCTGGTAAA CAGTGTCCAT CTCAGCTGTT CTGTCTCCCA 
77101 CACCAAATAC CCTGTGCACC ACGCAAACAC AAAGACAACT GGACAACCAA 
77151 GTCATCTAAT GAGTATGCAT GCTATGGTCT CTCTCATTTT GTCTTTCAGG 
77201 GCTATACCCT AGGAGAGCTA ATCATTCTTG GTTAGATAAG AAATAGCCAA 
77251 CACTTCTGCA GCATGGTAGG CCAAAT AC C A CCAGAATAAA CTCAGACCCA 
77301 AAGAGATGCT CAGAATGTGT GGAGTTAATA CTTCACTATA CAGCTCTAAG 
77 351 GTATAAGCCT TGTCCATCTG TCACATTATG ACATGTGCTT GCTCCCACCT 
77401 CAATTCCTGA TTCCACATTA CAACAAATAC AATTTCAGGC TTTGAACTAA 
77451 CAATGCCAAT GTTTCTGAAG CCCATATTAA ATGCCAAAAT CTGAGTCAGC 
77501 TACTGGAGGT AGAGACATGA ATAAGATGGT CCATATTATT TTAGAGGATT 
77551 CTTTGGTTGC AAAGGGCAGA CACCCAGCTT GAATTCACTT TGGAGAAATT 
77601 GGGATTTTTT TGGCTTGCAT AAGCAAAGCA TGAGAAAGAA AGTTCCAGGG 
77 651 ATGATGAAAA CCAGGAATGC AAATGTCTCC AG AATTC T TT CTTTTTTCTT 
77701 TTAGGCCATC TTTTTTCTCT CAAACTGGTT CCCTCCACTG GGCTGGAGAC 
77751 GTTACTACCA GCAGCACTCA GACCCACATC TTCAGTTTAA ATGTTGGAAA 
77801 TGGACTGTCA GAGAACATTT AGGCCATTCA TTCTGTGGGA GAGATAGGCT 
77851 ATGTAAAAAG ATAGCCACTC CCATGTGAAC AATGTGGTTA GGATTAGAGG 
77901 CATGAATATA CCCCAAACCA GGGGTGTGGG AAGGAGGTTG ACACTCTAGG 
77951 TGATAATACC CAGACCTTAA GGAGCTTTCT GTCTAGAGGG AGGTATGGAC 
78001 ATGGACAAGT AATCAACAGC TACAAAGCAG AGCTGCCAGC TCTGCAACAC 
78051 AAGAGCCCTG AGAGGCATGA CAGGGGCAGG GTGGGGATCC ATGTGGGTCT 
78101 GGATTGAAGT GAGGAGGGGC ATCAGGAAAG CATTCCAGGA GAGCTGAGGG 
78151 ACACTTGAGC ACACCCTCAA AGAATGACTG GGGGTCATGA GGTATACAAG 
78201 GGAGGAAGTG CACCCGAGAC AGAAACAATC ACATAAGCAA AAATGCAGAA 
78251 GAATATGAGG ATCGGGGAAG GGCAAGTAGC TCAGTAGTGT TGGAGGCCAA 
78301 GGGACACGAA GGAAGGTGAT AAAGCCCTGA TGTTAAGGAT AGAAAAATCA 
78351 AAGTCCTTTG AAAATCATGT GGAGTTAGGA TCTCAAGAAC CCTACAAGGA 
78401 TTTCTTTAGA ATAGAATCAA AGAAAAACAA AGTTTACAGT CTGTGAGGGT 
78451 TGCATAGGAA GTAACGTGGT GAGAAATGTT GGCTTGAGAA CCACATATCC 
78501 ATAACACAAT GGTGTTTTAG AGGATTTGGG GGAAGGGAGA GAAAATCTCA 
78551 AATTGTCTCA GTAACTAATG AGCTTTCATG T AC ATT T AAA ATAGTAATAA 
78601 ATGCAATTGT GAGGATGATG GTGAGATGAG CAAAATAATC CAGTTTGTAA 
78651 TTGTAGTTAT CAGGCTGGCA TATCCTGCAG GTCACACTTC TAAACATGAC 
78701 TTCGAAAAAT CAAAGATCAG CTAAGTTTGA AGTAAGTATT GAAAGAGGGA 
78751 GATTATGTTG CCTCAAGTTA AAATAGAACG TAAAAGATGG TGATTCAAAT 
78801 GATCAAAAGC ACCAAGCTTC CCTGTTAGGA TTCAAGGGAG GGGTGCGTGG 
78851 CTCCGACACC AGATATCTGC AAAGCAATAT GAAATGAGAT CAATAGTAGA 
78901 CATTGAAAGA TTGAAACTGA TATA GGATAT TCAAGTACCA GCTTCAAGAA 
78951 AATGAAATGA GACCTAATAA AAGAGAGTAG GAGTCAAGGG GGTATACGAT 
79001 ATTAAAGAAA GTGAAGAGCC AGGGTTTGTA GGAAGGAAGG GAGAAGAGGC 
79051 AAAGAGAGCA GCTCTTTTAA CACAGGAGCT TCCTCCTTTC CCATTCTCCC 
79101 TCCTGCTAAA AGCCGAGTTT GTTTTAGCTG AAATGATTGT AAGACAAATT 
79151 TTTATTATTA AAAAAGGAGC TATTTTGTGT TGGTTTCCAT TATAAAATCA 
7 9201 GAGCTCTGCT GCCATAAAAT TAAATCCCAT AATAAAATGA GTAGAAAACG 
79251 TGATGTCCTG CAGAAAGGAA GATGGCAGCC CACTCAGTGC CATGCTGGGC 
79301 TTGACTATAT ACAAGCCGTG CATCTCCTGC TGCGAGTTGT AGCTGCTGCC 
79351 CAGCAGTGCA CATTATCGTT GCAGCTGTTT TCCTCACATT CTGAGGTTTA 
79401 TGAAATCCCT CATCCATCAA TAATTGATCT TTAGCTCTTA GTCCAGGGGT 
794 51 TGTCAACTGG CACTCCATGG ACCTTTAGAG GATTGATGGC TAGGTTTTCA 
79501 AAGATCTTTG AACCCCCTGA AATT AT AT AC AAAATACTGT GTGTGAGTAT 
79551 GTGCATTTTT CTGGTAAGAA GCACCTGAAT TATCGAAGCA GTTTGTGATC 
79601 CCCCAAAAAG CTAAGAACTA CTTCCTAGAG CAAAGGGAGA TTTTGCTACA 
79651 CTTAGAGATT TACACATTTG ACCAGGGCAG CTCACACAAG TGGGATGCGG 
79701 TTTCACATTT CATGGCAGAT CTGCTTCCAG CTATACAAAT TCATCAAGGA 
79751 AATATTGTAA TACTTCTATA TGAATCAGGA ATTCACTATA TTTAACTTAT 
79801 TTGGAATAAG AACCACTATA TATATACAAG TTTTTCCAAA AGACTGAAGG 
79851 TTCTTCCTGT GGCAGGAAGG AATATGATTA GATTCATGAA GCGCCTTTAT 
79901 GTTTATATTT CAACTCTGAA AGATAATTGT GACTTTACTA AATCAAACCT 
79951 GTATACCACG ATTAGGAAAA TGTGGACTGA TTTGGGGTTC TAGGGGTAAA 
80001 ATGTGACCCC TGTGAAGTAC CAATGCACCG TTCTTTTATC TGTGAACGGG 
80051 CACTGAGCTT CTGAAATTAA TTAGTAGGCA GGAGGACATG CGCATATGAC 
80101 GTGATAGTTT AAGTACTGAT AATTATTCAC TTGGAAGGGA AGAGAATAAA 
80151 ATTCAGAACA CAGTATTCCT TAATGGGAAA TCAACTTAGA GGAGGTAGGA 
80201 GGGAGATCAA GCAAGAATAT TTCTGGTAAA ACATGCATAA ATCAATGGTC 
80251 AGCCAATGTG TTGATCAAAG AAATTATCTT TCGGGGAAAA CAGTAGAAGG 
80301 CAATTGAAAA ACAAGCATCA GGCTGCATAA AAACAGCAAA CAAAAGTCAC 
80351 AATGGCTTGA TTGTGTGATG AGGTAATTAA TGGCTGCAGT TAGCAAAATA 
80401 TGTTCAAAAA AAAGACAGAA AGGGTAGTTA CAGGAGAAAA ACATCCCCGC 
80451 AGATCTTCAA" AATCAGAAAC AATGAAAATA ATTATTTCAA AAATTAAGAA 
80501 AAAAACTCTC TAATTTATAC CTGAATTACC TGGATAATTG GTAAAATTTC 
80551 CTGCATATAC AAATCTTGGT CCTCTGCTCC TCTCTCTATA AATAAATAGA 
80601 AATGTATGAA TCAATAGTCA GCCAATGTGT TGATCAAAGA AATT AT CTTT 
80651 TGGGGGAAAA TTGGTAGAAG CCAATTAAAA AACAAGCATC ATATTGCATG 
80701 AAAACAGCAA ACGGAAGTCA CAATGGCTCG ACGGTGTAAT GAAGCCACAC 
80751 AATATGTATT AAACACATCA TCTACACAGA TGGATTCAAA GATACCTTCT 
80801 TTGTGTCTAA GTCCCAAATC TGTGTTTCCT GGCTCTGTTC CCTCATATCT 
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80851 AGTCATTCTC CAAGTCAGCA TGCCCAACTT GAAAGTGTCA TTTTCAAAAC 
80901 CTGCTTCTTC TCTTCTGGAA GTTCTTCCTC TGCCCATTGC TCCACAATCC 
80951 CCACCTCTTT CACCCAGTAG CAAACCTTAA ATTTATCTTT TACTTTGTCT 
81001 TACTTCCCCT TCTTATATTC AAAATGTTTC TCACTTGCAT CTCTTTTCAT 
81051 TCATTTCATA AGCATTTATG AGCTCCTGTT ATGGTTTGGA AACTGTTCTT 
81101 CATGCTGGAG GTGGTCTTAT AAACAAGTAA TTTCAATTGA GTATTTAGTA 
81151 TGTTAAGTGC CATCCCAAAG GCAAACACCA GCTGTGGGAG GCTCCCCAAA 
81201 TCAGTCTAAG G AAGTTGGG A AAAGCATCTC AGAGAAGATG GTGTCTGAGA 
81251 TCGGGAGGAT GTGTGGAACT GGGCAAGGAA GAGAACAAGT AACAACATTC 
81301 TAGAAAAAGG CCTCTTTCAG CATGCTAAGA AGTTTGGAGG ACAGAGGAGT 
81351 TACCATTCAA AATTTGGAGG GAAGGAAGAG CATACTGAGG TTTGCCACTT 
81401 GAACAGATAA TTTCAGCTGT GTTGGGTGAG TGAAGTTGAG TGGGTACAAA 
814 51 TCAGGTCAGG AATATAAGTT AGGAGACTGT TACTAGAATC CAGGCCAGAG 
81501 GTGATGGTGG CCAATATATG AGAGTTTTAG CAGGGAATGA AAAAAAGAAA 
81551 ATGTGTTCAT GAGGTAGAAG TAGGTAAAAA CAACAGGATC TGGTTCCTGA 
81601 TTGGAAATGG GGGTAGCCTG GAGAGGAAGC CAGAATGCAG GCAAGAATGC 
81651 ATAGTGGTAC CATCCACTGA CATAGGGATT AAAGGAGGAG AAGAAGCTTT 
81701 GGTAAAGAAA ATAAGAAGTT CAGCTATGGA ATGTTTGAAT TTGATTTCTC 
81751 TGATGAGGAG TAGTTCTAGG TGATGATAAT GCTCAGGGTG TAGACTTGAG 
81801 AGTGGATGGG TAAAGTAAAG GTTGAGGCTA TTAAAAGGGA AAAGGTCAAG 
81851 GAACTGAGGG CCAAGGATTT ATAATAAGTT ATCTTGGGCC ACTAAAGCCA 
81901 CGCAGGATGC TGGCAGGAAA CCTATGAGCC AGGTCTTCAA TGTTGAGTCC 
81951 AGTGACTCAG GTGTCAGAAG CAGCAGGAGA AGCATTGATA GCCTGATGGG 
82001 GAAGGAGCCG TTACCTGAGA GTAGCAGAGA GAGTTATCCT AGCTGACACA 
82051 GCTCTCAGGG ATTTGCTTCT AAAGCAATCC TTAGGAAAGA AAGAGCAGTA 
82101 TCCACAGGAG ACTGGTGGGC ACTGGCTTCC CCAGAAAACC TACCTAGATG 
82151 AATTCTATTC TCAAGGGACT CCTATTTAGA TAAGGGGCTT TGTTAGTTCT 
82201 CAGAGCAACA CCAAACAGAT GTATATCTCA TTACTTGCCC CCACAACCTT 
82251 TCTGCTCTGG CCACATGGGC CTACCCACTG TCTGCTAAAT GCACTTCATA 
82301 TTTTCTTGTT TCAGTGCCTC AGTATTCATA ATCTTCTTTT CCTAATCTCT 
82351 GCCCCTCACT TACCTGAATC TTTTGTATTC TCAATGACCT GCTCCATCCC 
82401 AGCCCTTTCA AGAACCTTTA ATACCTACCA AGTGAATACT CTCTCCATTG 
82451 ATT AC AC ACT TCCTGTAGCA CCTGTTCTAT AATTATGAAA TATTACCTAT 
82501 TGTACACATA TATTTCAATC TCTTGGTGGA CAGAGAATCC AATTTATGCC 
82551 TTGTCAATTT GTAGCACATT TCCTTGCATA TGTAGATGCA CCATGAATAT 
82601 TTAGAGAACT TGTTAGTTAA TTTCCTGTTT AACATGGGCT GCAAAGTTCT 
82651 GGTCCATGCA CGTCTTTTAT AAAATAGAAA TGACGGATGG TGCATGGAGC 
82701 TTAAATTCCA TGAAGCAGAA ACATATGAGA GATGGAGCTG AATTTGTTTG 
82751 CCTGTACAGC TCTTACAGCA ATTGCTTCCA ATTTGTTTGA TTTACCTAAG 
82801 AGCTAAAATT GTAAATGGCA GCTCAAATGA TTTTTCTGTA CAT TC AG AAA 
82851 ATGAGTTTGA ATATTTGTTG GAGAGTAACT GCTTAAGACA TGAAAAAGGG 
82901 GGAGATTATA GCTTTTAACT CTTTTTTATG GCAGAGCATT AAGGAAAAAA 
82951 AAGTGCAGAT AAATGAGATC AAATGGCAAG TGTCTGAACC TGCTGGACAC 
83001 AAGTCCCGGT AGCCATTGAT AGACAGTGTT TATATGACTT CTGGGCCATC 
83051 AATAGATAGA TAAGGTACAT CAGCGGCCAA TGTTCCAGGA AGTTTGAGAA 
83101 GATAAATGGA AGTTGCACAG CAGCCTAAAA GCTTCCTTAG GAGGGCTGTG 
83151 CTCCTCCAGA GCGCCATCTG CCTGTGTCTT CCTGTTCTTC TTCTTCACAT 
83201 TAAATGCTTT TCCTTTTCTC ATTTTTATGA TGGTTATCCT AAAGATATGC 
83251 TAGCCTGGAC TTTGACAAGG ACATCTGGAG ATAAGAAAGA TTCTGAATTA 
83301 TTTTTCCCTT TGGGCAATTG TAGCAATTTT AAAACTATGT TAGATGGCTA 
83351 GAGATTCTTG AGAATATTTC TTTTCTTGGA AAATCATAAG GCTTTGGATA 
83401 GTGGTACCTA TAGAAGCTGA CATCAGCAGC AGCCTGCCTC CAGTCGATCA 
83451 GGGCCTTTGG AACTTCACGG GGCTCCTCTA CTGACAGCCC CATCGGTTTC 
83501 CCTCCAGCAC ACGTAACTCA GCATTGACTC TGGGTAGTAG AGGGTGGTTT 
83551 ATGGAATCTG ATTCATCTCA GAAAGAGGTG GATGCAAACA CATTCCCAGA 
83601 GCAGAAGGCT TGGCATGTCT GGTCTTAGGC AGAGGGAACT GGAGATACTT 
83651 GTCCTATTGT TCTTGAGATT CCAGCAAAAA TAGCCCATTA CAGAGGAAGA 
83701 AGATATCAGG TCAAATGAAG GCTTTGGTGC TACAACATTG TCTTAGAAAA 
83751 AAAAAGAAAG AAATTGGCCA AGTGCAGTGG CTCAGCACTT TGGGAGGCTG 
83801 AGGGGGGCAG ACCACTTGAG ATCAGGAGTT CGAGACCAGC CTGGCCAACA 
83851 TGGCGAAACT CCGTCTCTAC CAAAAAGTAT TAAAAAATAG CCGAGTGTGG 
83901 TGGCGGGCTC CTGTAATCCC AGCTACTCGG GAGGCTGAGG CCGGAGAATC 
83951 ACTTGAACCT GGGAGGCGGA GGTTGCAGTG AGCCAAGATC GTGCCATTGC 
84001 ACTCCAGCCT GGGCAACAGA CTGAGACTCC ATCTCAAAAA AAAAAAAAAA 
84051 GAAAAAAGAA AAAGAAAAAA GAAAAGAAAG AAATTAAATT AAAAAAATTG 
84X01 TTTTTTAAAC AAAGGAAGGC TTTGGGCTTG GAGTCCAACT AAGCTAGGCT 
84151 GGAATCCCGG TTTCATCTCG CTTCTCTGTG CAACTTTGGA TTTTACTGAA 
84201 TCTCTCTTAT TCTCAATTCC CTCCTCTGTA AAATGAAGAT AATGCTAGTA 
84251 CCTGTCTCAT CAAGTTGAAG GAGACTTAAA TGAGATGTGT TGAAAGCATT 
84 301 TAGCATAGTA TGTGGCACAT AAAGAACACT CAATAAATGC TGGCTATAAA 
84351 GAAGCCAGAG AGAGACTCGG AGGTGATGAG AGAGGCCACA ATTCCCTCCA 
84 4 01 TTTCATTGAA AAGCAATTTT TATTATCTCA TTTGAAAGGC AGTATAGTAT 
844 51 AGTGGTTAAG GACATGCACT ATGGAGCTAG ACCTCCTCAG TTCACTTTCT 
84 501 GTCTCTATCA TTTATTAGCT GTGACTTAAC CTTCTTGTGC CTCAGTTTTT 
84 551 ATCATTTTTG AGAGAGGAGT AATAATAGTT CCTACTCTGG TGTGTTGTGG 
84 601 AGATTTGATG AGTTAATACA TATAAAGCAC ACATAGTAGT GCCTGGAGCA 
84 651 TATTAAATGA CATGTAAGTA TTAGCTGTTA TTTTATTAAA CAACATGTGG 
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84701 CATAGGACAT ATTGGAACTT TGAAGTCTTT GAGGCTCTTC CCAGTTTCAT 
84751 AAATCAGAGA CTACAGTATA AATATCTGCT TACATGTCTG CTTTCCCCAT 
84 801 TGGACTGCGA AATCTTGAAA CTGTTTTATT CATCTCTGCA TAGCGTTGGC 
84851 ATCGTATTAT GATACCTGAC ATTTACCAGG TGCCAAATGG GACTGGGCAT 
84 901 GTTGTAGGGA TTCAGTCAAT GTGGGTCATT GCAGGCGGGG AGGTGGGTCG 
84 951 GGTTAAAGGT AAGAGAAGGG CCTTGGGGCA TCACATTAAG TAGTTACCAG 
85001 ATTGAACTGC AAACATTGCT ATCCAGGAGA AAT.C AGGT C A ATATTTCACC 
85051 TTCATGGCAA TACCAGTACA GTCCAAGGAG AATGCATAGA AGGAAAGAAA 
85101 TCATAATCTG ATTGTATGTG TTTTTTTAGT AGTAAATAAT AATAATTATT 
85151 ACTATTCCTA TACAATTTTG TGTGTTGGTG TGTTTTGTTT TGTTGTGCAT 
85201 GAAAAATGGG GTGCTAATCT ATTCCCCTTC CCAACACCAG TGCTCAGAAG 
85251 AAATTTCCAC AGATAGAGAA GCTATAGGTT ATGAATTTGG CCTTGATGGA 
85301 TTCTGGGTCA CTATTTCTCA ATGTTTGTCC ATGTCATGTG AAGCTCTTAA 
85351 GATAAAGAAC AATGTCTTAC TCGTCTTTTT AACTTCTTTA CCCCCTAATG 
85401 CCTATCACAT ACTTTGCCCA TGGAAACTCA ATAGACATTT GTAAATGGAA 
85451 TTTAATTTCT GAGGTCCAGT AAAGCCTTTT TCCATCCTTC CCCTACTACA 
85501 CAGTTTGTCT AACCATGTCT TCCCTTCCAT CATCCACCTT ATAAACGTTA 
85551 TTACTCATTC TTCCATCACA TTCTTGACAC CTCCCATGTC CAATGTCAAA 
85601 CAAGTACCAT TTGGGAAACA GAATTCTAGG AATCTGGAGA CCTAGAGCTC 
85651 TTCAGACCCT GAAATCCAGT 7TTCTGAGCT GAGACAGTTT CTTAATTTCT 
85701 CACTCCAACT CCCTTTC7CC TCTTTCTCAA TGGATATTTT CCAAGTCTCC 
85751 ATTAGGCATA TAGCAA77CC AGAAAACATT CAATTTTCCC TTCTCTTAAT 
85801 GCCATGCTCC AAAACACCAC A77CCCTCTA GACATTGAGC ATTGGAGAGA 
85851 GATGGAAAAG TACTTTGAAA AT CTGTGC AT GTGAGAAAAA TGCTAAGTGT 

85901 TCTGTCTGGT CACTTCAATG A-J AACTTTGC TACTTTAGAA ACTTGACTAA .y 

85951 ACAGAGTGTG A."a^AAAACA 7 ~\AAAGAAA AAAATGTGTT CAGCTTGGCT 

86001 GAATAATGAC CAC,7A*GGGT . AAAAGATAAG ATAACCACCC GCTCACAGGA 

86051 TTTCTATCCT CAAOCCrCTA^ A>\ G* ~.TT> \A Z A ACAGCAGACA CTGAAACTAC 

86101 TCTTAATGGA G^K.TC.'TA AA.VAAGCAAC ATTATAGCCG CTTTTAGGAA 

86151 AGCAAATAGG AAACTTO'". T^ i AAA 7 A ~, AGAA GATGCCTAAG CATGTGAGAT 

86201 ACCACCTCCA TCTTCOAAAA TAAZCAACGT GATACAATGT TATGCAGGAC 

86251 CCCTTAATTA AAA J A; "»A 777 A"J7CATTAAT ATCAGGAGCA TTGTCAAGAA 

86301 TCACAACAAC AG7AA77AG7 TA.'TATTGAG CAATTTCTGC TAAGTAATTT 

86351 GCAGGAGGGC A7CTCACTTA ATT ATGACAT CCTTTTATAG ATGAGAATAT 

86401 AGAGGCTTAA AAA GG TO TTT TT/CCAATCT TATTCAGCTA TAAGTGGTCA 

86451 GTCATGACTC AAA Z AT AGGT CAA77TGACA ACAAGATCTT CACTCTTAAC 

86501 TTCTCTTCTG TGTTCTAATA CCrTTGATCC ATGGAAATGG ACCATCTTCA 

8 6551 TATACTGCTT TTTTGC CTCT GCAATGTCCA GGTATGGATT GGGTAATGCT 

86601 CAAAGACAGA GAGCAATAGA GTATTAAAAA GATCCCTGGC CTCATTTTCT 

8 6651 GAAGACATGA GCCTAAGCTC AGC7G7ACCA TTTACCATCT ATGTGAACTT 

8 6701 GGGCAGATTT TTTGAC A77G C7GGGTCTCA ATTCCTGTAA CTGTCAAGTG 

86751 GAAGTGAGCC TAACTGCATA GATTTCACTG GGCTGTTAAG AGAATAAAAT 

86801 GAAATAACTG T AAAC AGAA ?* TGZCTAGTGC ACATGCAAAG GATTATTGGG 

8 6851 GCTTTCTACC CTTCAGGGAT 7AGAAGTTGA TAGTAGGCAA CAAGTTATAA 

86901 GAAATACAGT CAATTCTCTG CTGACCAGGG CTAGAGTTAA TTGTCTCTGG 

8 6951 AAAAAAGGAC TTGCCTCTCT 7TCTCTTCTT CCTCCAAAAC TTAAGACGTT 

87001 TGCAGCTGAA TCCCCAACAG GATTTTGTTT TCCTTTGGGA GAGAGGAAAC 

87051 AGACCAATAT ACCCCCAAAA CTAACCCCAT AATTTCATTT C AGC AGT AAA 

87101 GTGAGGTCCT TGATAACTGC CCTGCCCAAC CTGCAGGGTG GTTGGGAAAC 

87151 TCTGAATGGT CATGCATGGG GAAGCATTGT GTCCACTGTA AAGAGCTCTC 

87201 CGGAGATGAT AAATCTCATC AGAAGGCTTC ATGCTTGAGG CATGGATTCT 

87251 TGGAAAAACA ATCACTCTAC GTATGTGGTC AGAATCTAAA GGAGATGCTG 

87301 GGGAGAGGAG CTAGGTCAGT CTCCAAAGTG GAACAGTAGA AACTAATCAT 

87351 GTGGAGCCTA AACTTATGAA GGTTTTTAAA ATCAGAATTG GCCACCTTCC 

87401 TTTGGACCAT GAGCTCAGAT TGTGAGGTGT GACTAGGTCA CGTCTCCTTC 

87451 CTGCCCCTGT TTCCCTCCTC TCCCTACCTG TCCCTCCTTG ACCCCAGGAA 

87501 AAATTGCCGG GATATGAAAG TTAATTATGA CCCAAGGGAA TTGGTACAGA 

87551 TGGGGAAGAA AGAAATGCAT TCAAGAGCAT TTCCATCAGT ATTGAAATTA 

87 601 CACAGAAGGC TGGTGAATTT GGGCTATCCA TTCTTGCCTC CCTCTGTGCC 

87651 CATAATTCCT TGGCCTCCTT CAATTTCATT TTCCCTTTGG TTCAGAGGAA 

87701 TGCTTGATGG CTTAAGCTAG CCTCAGTTGG CCAAGCATTG GAGAAACAGA 

87751 GAGGTGTATG ACACAGCTAC ACTCCCATGG GGCTTACAGG GCAAGGTGAG 

87801 AGAAGACAGA AGTTGTATGT GCTGGGTGCC ACGTGGTAGC TACAAACTAG 

87851 AAATGAGACC AGGTTCGGAA GAGGAAGAGG GCTTGCAGAC CTGAGTCATG 

87901 GGGACAGTTT CTTCAGGAAA TGGGATCTCA GCTCTGCCTT GTATGCAGGG 

87951 CTTACATAAT AAATATGTTT CATTGTTGTT GTTGTTATTG TTGATTTAAT 

88001 AAGATTTTGT TTTAAGAAGA TTTTGTAAAA ACAACTGAAC AAATGCAATC 

88051 TCCTGCCAGA GCAGGCAGCA GCAAAGGAGA TTAGGAATAT AACCCCCTTG 

88101 GAGACGTTCC TTCACCTACC TGGTGCTGGA TTACCTAAAA GCTTCAGCTA 

88151 AGTAGGGTCA CCCCCCCAAG AAATTATTTT AAAAAAATTG AAATCTGATA 

88201 TT TTT AG AAA ATCTTATCAA GGATATTTAA TTGGACTATT TACACCTATT 

88251 TAGGGTCAGT CGGTTTTGGA CAAGTATGCA GGGGTCTTGG AATCAGACCA 

88301 CTGGGGTCAA ATCCTAGTTC TGTCACTTCC TAGCTGGGTG ACCTTGGACA 

88351 AAGTTACCTG ACTTCTAATA GCTTCAGATT CCTCATGGGC AAAATAGAAA 

88401 TGCTACTAGT ACTTAATAGT GCTCTGAGAA GGATTCAATG AGAAGGATTA 

88451 AATGTATGTA AAGCACAGTG TTTGCCCATA GGAAGCTGTT ATTTATAAGG 

88501 GAGGGGAGCA TCCTAAGGTC CTCCGAATTT AGGAGAACTA AAAATCTTAC 
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88551 ACTGACTTCT CCCTTCAACA GCACCTTCAG AATCTCCTTC ATTTTTCATA 
88601 CTGTTCTTTC AACCCTTTGA TGAATGAGAA ATTAGGCATT CTTTCCCTGC 
88651 AGATTTTCCC AAACCTTCTG ' CTTTGGCCAA TAAACATATT TTTAGTCCCA 
88701 ATCTTGCATG CTCCTTTGGG ACTTTTCATC TGATAAACAT CCCCTCCTGT 
88751 GCTCTTGAAT CCAATACCCT TCTTCCCTGC CCTCCACCCA GAGTCTCCTT 
88801 GTATCTGCTG TTAGGCACAA TGATGACCCC ACCAAGGTCA GACAATGGCT 
88851 GTGGCCTCAC CTGGACCTTG ATGACCCACA TAGCCTAGAG CCCAGAGATC 
88901 AGCCACTGAT GGAGGCCCAG AGGGCAGTTG GAAAACTTCA CAAGACAATC 
88951 CAGCCTGATT GTTTTGACAT GCCTGACTTC AGGCTGCTAA AAATGAGCTC 
89001 GAGGAATCAG ATAGGAAAAA GAGATAGGTG ATGCAATTTT ATTCCATCTC 
89051 CCAATTTTCT GAGTCAAGAG TTGTTTGTTT AACTCCAGTT AAATTAGTAT 
89101 TTATCCAAAT TTCCTGGGTG CTTGTCCAAA GAAAAGTACC CCAGATCTAC 
89151 AAATTAGAAT CTGGGACTGG GACTTAGGAA TTGGCACTTT TACAATTATA 
89201 CCAGATGTTT CTAATATGAG TACTTCAACC ACTACCCTTA TAGAAGTGCT 
89251 GCCTAGGACC CTCTCTTCTG GCAGGTGAAG TGGAAGGAGG TTTTGTCGAA 
89301 GGGAGATTCT CCACTTCAAC TTGAGTGTCT TGGCTTGTAT CCGCTTTGTT 
89351 TGGTTCTATT TCACCAAAGG CTTTCATCTT CACATAAATT TTCTTCAGCT 
89401 TTAAATAATT AGTTTTGGTA ACCATTGGTA TACTGGAAAG AACATTAGAT 
894 51 TTGGAGTCCA GGTGGCTTGA GTTCAATTCT CTGCTCTGCC ATTTACCAGC 
89501 TGTGTGACAT TGGGCAAGTT GCCAACCTAT CTATGTCATT TCCTCATGTA 
8 9551 AAGATAATCC CACTTCACCA GGCCACTTTT GAGGACCCAG TGAAATGATG 
89601 TGTAACCATT TTAGGAACAC TGGATCATTC TACAGTGCAA TTTTTTACAT 
89651 CAGCTTGGAG CCTACCATGT AGGCATTCAA ATCCACTGAG TGTATGGAGC 
89701 TCCGTGCACA AATAAAAGGA CTTCTCTTTT CTGCCCGTGT ACAACTTTGG 
89751 TTTCCTTAAT CAATAGAATC CATGACAATC CTGGGCCATG GTATAAAGAT 
89801 GGGACTTTCT TCCTGTGAAG GAGTCTGGTC TGAACATCTT CCAAACTCCA 
89851 ACATAACTGA TGTCATTTCT CCACCCAACC CCATTTGCTG TCTCCTGACT 
89901 CAATTGCTAG AGAAGCCACT TAAGGAAGGT TCCTGGAGTT AAGGCTGTGT 
89951 CTGGGCCAGT GTAGCGAGCA GTTTTCAACA GTCAGTCCTC TTTATCTTCT 
90001 CTTTTCCTGC GAGCCTTTAC TAAGCACTGC CTCCTCCTGT CTCCTTACTG 
90051 CATCTCCTGA TGGAATGCAC AGGTAAATCT CCTTGGAGAG TACCAGCCAG 
90101 GAACAGTCCA CAGCCAAGGC CACCGATCCT CACCGCTGAG CTCCATCTTT 
90151 CCTTTCAAGC TGTCCTTCCC CTCCCCTCCC CACCATCACC ATAGCAACAC 
90201 AGTGGTATAA AAAAATGAAA GCGCTAAGGC ATCTAAATAT AGTCTGAGTA 
90251 TCAACTCTTC CAGCATGGAG CCGAAAACCT AGGGAATGAC AGCTAGAGGC 
90301 ATCCAGACGA TAACTGGCAG CCAGGAGGGT GGATAAGTCA AAGGAAGGGG 
90351 TCAAGGAAAG AGGGGAAGGA AAGGGAACCA TCACTTGCTG AGCCTGCTGC 
90401 CTGTGCTTTC TCATGTCACC CGCACGACAA CCCAATGTGA ATGTTATCAT 
90451 CTCCAGGTAA CTGCTGAAGA AACGGAAGCT CAAAGAGGTA AGAGATTTGG 
90501 CCAAGGTCAC ACAGCTATAA GCAGTAGAAC TAAGATTTTA ACTCAAGTTT 
90551 CTATGGCCCC AGAATTTATG TGTTTCTCTC TCCATACCAC AGGGACAGGT 
90601 GCAAGTGAGA GATTTTGCTG GAAGCACTGG GCTTTTTGAG CAGGCCATAT 
90651 AAAAATTCTG AGCCCAGAGC TCAACTAAAT TATTGGAAGA GACTGGGCCA 
90701 AATATAAGGC TTCTATCTAA GCAGCACCTG TGTTTCTCAA GGACTGAGGA 
90751 AAATGAAGGG GGAGGGTTGG CAAGGCTGCA TTTCCCAGGG TGCGTGATTA 
90801 TATGGCATGG GGGTGGGGGC CATTATGATG CCCGGACATG GAACTTACAC 
90851 CAGTGCAGAA AGGGTGTGAT TAGAAGCCCT AAGCCAGAGA ATGTTCAGTG 
90901 TGATAAATGC CATTATTTTT TCCCTCATTC ATTCAATAGA TTTTTTTTTT 
90951 AGATGGAGTC TCACTCTGTC GCCCAGGCTG GAGTGCAGTG GCACCATCTC 
91001 AGCTCACGGT AACCTCTGCC TCCTGGGTTC AAGCAATTCT TGTGGTCCAG 
91051 CTTCCTGAGT AGCTGGGATT ACAGATGTGC ACCACCACGC CTGGCTGATT 
91101 TTTTTTTTTT TTTTTTTTTT TGTATTTTTT AGTAGAGACA GGGTTTCACC 
91151 ATGTTGGCCA GGCTGGTCTC GAACTCCTGA CCCCAAGTGA TCCACCCACC 
91201 TCCACATCCC AAAGTGCTGG GGTTACAGGT GTGAGCTACC GTGCCTAGCC 
91251 TCATTCAACA GATATTTTTA TTAAGCATCT GATGTGTGCT TAACTCTGGA 
91301 AATATAGGGG TGATTAGAAC AAATGCAGCT CCTGCCCTTG TAGAGCTTAT 
91351 TAGGATAGTG GAGAAGACAA ATAAGGAAAC AATTATACAA TTGATTGATT 
91401 CTTTACAACT GTAACATGTA CTATAAGTAC ATAACAGAAG AATATCACTT 
91451 GCCTGATGAC TTCAGTGAAA GGGAAATACA GAAGTTCTTA CAAATCAAAG 
91501 CAATCCCCTG GGCCAATTGT AAAGGTGATG CCCACTTTCA AGGTGGACAG 
91551 AGACTGTGCT AGAAGCTTAG CCTCAACCAT GGGTTTATAT GATTGGTAGA 
91601 CCCTGCAGAT CCATTCCCAA TGGTGTATCT TCATACTAAT CATGAAATCC 
91651 ATCTAATAGC CATACAAGTG AGGTTTTAAA AC CC AAC AAA CTAGACTCAA 
91701 ATGAAATCTG ATGAGGGAAT TTATGATTTG TTCTTCCTAC AGCCTTTGGT 
91751 ATCACTGACA TAAAACTGAA TGTATGTGCT GAGGGTGCTT GTGTCTTGGT 
91801 GATAGACAAG GTAGGTGGTC CAGCCCATGG TACTGGCAGC TTAAAGTCAG 
91851 CCAGCCATCA GTGGGAAGTG CCTGTGAATT ATGCAGGAGT GGGAGGGGAG 
91901 GGAGTAGGCA GTAAAGTAAT GCATTTCTGT GGATCCAAAG CTTTCCAAAC 
91951 TACCTGCAAG TCAGCAAATA TGGGGGATGT TGTATGACTA AGTGAGAATC 
92001 AGATAATATA ATGTGTATGG AGCTCTTTAG TTCTTCAGAA AAAAATGCTG 
92051 TCTAAACAAA TAGTGCTGAT ATCAAAGATA ATGATACAGT ACCCTAATTT 
92101 TAATGCTCTG CTACCTACCT GCCAGCTGTT TCCCAGGGAT GTGGTAAAGA 
92151 TGAATGGGCA AGATCTGGGA AAGTGTTTTG AAATCCTTGA TTAAAGGCCC 
92201 TCCAGGCAGA TGTAGAATTT TAAATGTGTT ATATTACTGC CACTATTGTT 
92251 ATGCTTTCTT TTATCACCCC AGAATTTCAC CATCTCCTGT TTCAGGTGAA 
92301 CGAGTCTGCC TGACTCTTAC CTGCCCTGAA TGGCATTGGA AAGGTAGCAG 
92351 CCCTGAGATG TGCCATATAA ACAAACATGT TTTTAACCAA GGGATCAGGA 
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92401 GGCCTTCCTG GCTGGCTCCT GTCAGCTGGT CATCACCTCT CTATAACTCT 
924 51 AGGCTTTCCC AAGCTTATTT TATTTCCATC AATAGGACAG GAATATGTAA 
92501 ATGTCCTGCT TGAAATGAGT ATTGGCTACA AGCCATCTGC CTCTGAACAG 
92551 AGGTGAAAAG TGGAAATCGG AGGAAGGGCA GATGTCTTTT GCAAGGGAAA 
92601 CAGACTGTTT TCTGCCACTG CACTCTGCCC AGGCAAAAGA GTAAAGGAAC 
92651 AGCACTCAGG AGAATTCACT GAAGCGAGGG CAGGGTGCAA AAGGAACTTG 
92701 AGAAATTGGT ACTGGGACCC AAAAT C AG AT TCTGGCATTT CTGGGAAAAG 
92751 AAATGGGCAT GGGTGGGGGT TTTATCTGTC AATAAAAGCA TCCAGAATGG 
92801 GGCTAGAAGG AAGTAAATTC AGTTGCCACC TCTGCCTACT GGACAGCCAC 
92851 jGAGAACTTC TCCTTATCCA AGGTCGAGGA GCCCTCCGGA GTACATACTG 
92901 ATACCATTGG TTCTCCCACA CATACCCCCA TGGAGATAAA AACAGGACCC 
92951 TGGAAGCCCT GTCCGTGTTT AACCAATGGG ATTGAAACAT GGAAATGAAC 
93001 TGCCCCACAA TCCACCCTGT GAGAGACCAA AGAGCAGTGT TGGATTAACA 
9 3051 GGGAATGTTA CCCTGAAAAG GCATTCAGCT TCCACTGGGG CAGCAGGTAC 
93101 AGTGCAAAGA TGATCCCACT TAAATTCCTA AGACAGGAAA TAAGGAAAGA 
9? 151 TGTTGTGGAA ACTCAAGACC TCTCAAAGCA TACTCCTTTG TAGTTCTTCC 
93TC2 GCAGACCAGA CCACGGAATT CAGAAAACAC CCTACCTGGT TCCAAACCAG 
932 51 CACCTGCCAA ACTTCTCACC CTCTTCTGAC CCTGTCCTGG GAGTTAAGAA 
9 3 3C1 AAAAAAAATC ACTTTATTGG TTGCTCCAGT TATAACTTAA ACAGACAGAC 
9 3 3bl CATC ATC AAA TTAAGTGACA TGTACGACTG CTTATTGTAT GCCAGTTACT 

93 4C1 GTGCTGTGGG GTTTTGGTTC CATTATCTCA TTTAATCCTC TCAAAAACCC 
934 51 TGTTAGGTAG GTTTTATTAT TGCACTCATC TTAGATTAAG GAAACTGAGG 
9350Z CTCATAGAGA TTCGGTAATT TGTCAAAAGC CCTAAAACAT AATTACTGCC 
93551 TCCAGATGTC TCTGATTCTA AGGCCCAGGC TCTTAATCAG TAAATGATCA 
93601 AATGAATAAT GATTTTCATG GCATCTGTCA TCGGAAAGAA CAATGGAGAA 
93651 TATCCTTAAC CAAAGTCATA ACCAAATAAA TGAACTTGAC AGCAGAGCCG 
93701 TGATTCTAGC CAAGATGACT ATTTTCATGC ATGTTTTGAA GGCCAGGAAA 
93751 AGCAGGTTAG ACTTGTTTGG GAAGGGAAAC AGGAGCTATC AAGGTGAACT 
93801 TTTCCTAAGA GTAGCCCAAT AATAGTGCTC GGGAGGGAGT AATGTGTGCA 
93851 AGAATAGAGT CAGGGAGACC AGCCAAGTGT GTGCCTCAGC ATCCCTAGCA 
93901 CAAATCACAC ACTAAGCATT AAGATTGTCT CTGCAGTGAG AAAGGCCTGG 
93951 GACCAAATTT GGGCTCCACC ACTTACTGGT ATTCATTAAT CATTCATGCA 

94 001 TTCATTCAAC AAATATATAT TGCGTGTGGT CTATGTGCCA GAGACTGTGC 
94 051 TGGGTGCTGG CAAAGAACAC AGACAAGGTT CCTGCTCTCA TGGAGCTTTT 
94101 ATTCTGATGA AGGAAACAGA CCACTTACAG ATAAATAAAT AAACAAGATA 
94151 AAGGGAAACA GATATGATGG AGAGTAGCTG GAGGGCCAAG CAGACCGGGC 
94 201 AGACAAGGTG GTGGCATGTA AGCT AAG AC A TTTAAAAAGA ACCTGGTCAT 
94251 GAGACTATCT GGAGAAGGAA AGCTCCAGGC AGAGGAAGCA GGTAGTGCAG 
94 301 AGGCCCTGAG GCAGGAATGA GG AC AAG ATA T TT G AG A AAA CAGAACAAAG 
94 351 GCAGGCATGA CCAGGCCGAG TGGGTGGTGG AAAAGTAGTA GAAGGTGAGT 
94 401 GGGGGAGTGG GGGCATCAAG GTCAGGCTTT GCAGGCTTGA TCAGCGTTCT 
94 4 51 CACTGTGGTT CTGGAGCCAG CAGCATCAAT GTTACCTGGG AACTTGTTAG 
94 501 GAATGCAAAT TCTCAGGCCC CACCCAGACC TGCTGAGTCA CAAACTCTGG 
94551 GATGGGGCAC CTCATTGTGT TTTATCGAGC CCTCCAGATG ATTCCGAGTA 
94 601 TGCTAAAGTT TCAGAATTCC TAGGTTGGAT TATGCAGTTC AATTTTAATT 
94 651 TTAAATGCAA TGGGAACCTA TGAAAGATTT AAGT AGGGG A GCAGCATGTT 
94701 ATAATTTTCT TTAAAAAATT GTTTTTAAGC ACTCCTGCTG AGGAGAGAAT 
94751 GGACCATAAC AGGCTAAGAG AAATGGAAGC AGGGAGATAA ATTAGGTGGT 
94801 TATTGCAAGA GGCCAGGTAA G AAG AG AAAG TGGTTTAAGT AGGGTGGTGT 
94851 GGCAGAGAAG ACGGTTCCAA GCAGAGGGGG ACCACGCTGA CAAATAAGCG 
94901 CGGGCCACTC ACGCAAGCCC AACAAGGCAG AAGGCAGAAG GCAAAAGTGA 
94 951 AGGCCAGAGA AAACTGGACA CCACCTTTCC AGAGCACAGT TCAAAGGCAA 
95001 TGTCCTCAAA GAAGACACTC CACCCTCCTC CCATTTCCTC CCTATTGCCT 
95051 AAAAATAAGA AGGATACGCG GCCTATGGCA AACCTTGGGC AGGCACGTGG 
95101 GAGCTGAGCT CTTGCAAAGG GCAGATAGTT CCTCTGGTGA GAGAGAAAAG 
95151 GAAGGGCCAG TGAGGAGTGA AGGAAGAGAC GAACAGAGAG CCCGAAAGGC 
95201 TGAGAACGTT GTCTGGCTTC CTGAAAGGCT TAAGGGGTTA GCTCTGGAGG 
95251 GTGAACTAAA AGCCCTAGTT ATATTAAACA CACACGCACA CACGCACGCA 
95301 CACACATGCG CGCACACACA CACACACATA CACACAGTTG AAGGAGACCT 
95351 GCAGTTTCCA AAAAC AAG AG TTGTATTTTT TTTGTTCATA TCATGACCCA 
95401 TAACAATCTC A AAAG AG AAA CAATCTCTTG TCTTCCTTGT TTAGGCTTAG 
95451 GAGAACCTGT AGT AAGT AAG CAGCAGCAGC GGAACTCAAA CTCGACTCTT 
95501 CCTACTGTCA TTCTCTCTAT TACACCACAA GGCATCAGAG GACCACTAGA 
95551 GTCGCCTCCC TAGGGTTAGG GTTAGGGCAA GGTAAATGAA GTGAGTCAGC 
95601 AAGGGCAGGA TAGGAACCTG TCTTTATTAA CATTTTGATA TTTTGTTTAT 
95651 CATGGATTTG TTGCATTAAT TGCAACTTTT AAAAATCATT GCATTAAAAT 
95701 AT TAT TG ATC TTGATTACTG AGTTTTTAGG TGTACCCTTA AATGT TGC AC 
95751 CTCTGACTTA CTAGTCTCAC CCTGATCCCT GTCCTGGATC TATGCCTGTC 
95801 TGTTCTATAT CAGCCTCTTG CTTTGACCAT AAGAATAACT TCAGACCTTT 
95851 AAGCATAGAG GAAATAGGAT TTCTGTCTCC CTTCCCCACC TTTGTGATAA 
95901 TCTCAGCTTC TGCTTTTAAA GTCTATCTCC CAAGTAGTTT GCCTACTATG 
95951 TTCCTCCCAA GGTCACTAGG TTCTGTGAAA CTAGCAGCAG GCTAGATTGT 
96001 CACATTAGCA CAAAGGATCC ACTATTCCTG CAGCCGAGCT GGGACAAGCA 
96051 CTTAGGCCCA CTGACTCCAA CCCTTCAATA GCCTGGGACC TACGTTGTCT 
96101 CCAGGTGGTA TAAAACAAGA ATTTCCCCTT TGACTGGGAG AAAAAGGGAA 
96151 GAACTCTAAA TTGGAAAACA GGTCATCTCG AATTCTCACA GGTGGAAATT 
96201 TCTGACAACC CCTTTGGGAC CCACAATTCA ACACACCCCA AATGGGG AC A 
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96251 GTAGCTAACA TGCAACCTGT AGGCTGTTCT GTCATCCAGT GCCACTGTGC 
96301 TGCACACCAC CAGGGGGCAG CATTCTCATT GGCTTCTATG TGCCTGGAGC 
96351 CCAGTGCAGT TGTGCAACAC TGCAGCTTTG CTTTAGTGTA GTCCCTGATG 
96401 GGTTCAGTCA AGAAAATGTC TATAGAATCA GCTAATCTCC CATGCAGTTA 
96451 AGTCTCTAAT TGAAATATTT TCTCTGCTCA GCCCAGGGAC AGCAATCTTT 
96501 CCTGGATTTG CTATTTACAA GGATCTCTAG AAATTATCCA CCAGAAATAT 
96551 GGGCTTTCTC AGAGCTTGAG TGGACAGGGA ATTAAGGTGG AAGGCAGGGC 
96601 GTTTTGACTG CATTTGACCC AAGTCCTGAA GAGCCAGCTC CTCTCTCTTC 
96651 CTAATTATTA GAAGGTTTTG TTTGGACCCA GTGTTTCACG TGTATACAAT 
96701 ACAAACTTCT CTCTTTTCTA CTTGGATCAA ATTTGTTCTC TCAAAATAAG 
96751 ATTCCCAGCA GTGAGAGAAG ACAAGACAGA GAGATCCAAC ATCTCTAAAG 
96801 CCATGAATCA GATAACCAGC CACTTGTTCT CTTCAGTGCT GGGAACAGAT 
96851 ACACTGTTAA ATAAAATGAT TTTATAGATT CTTCTCACTG CCTTTCCAAG 
96901 AAGGGG AT TT ATCAACTTCA GGGCACAGCA ATCATTTATT CCCAGACTAC 
96951 TGGCATGCAT ATATATATAT ATTTACTTCT CTTGACTTAG AAAAAAGAGA 
97001 GAATTGGAGT TGTGAATATT CCTGTCTCCC TCACCCCAGC CCCCTTGAAG 
97051 TGAGTCAGGA CAAACTTGGG GCCCAAATGG AGCTGTAAGT AACTGAGTCA 
97101 CATGCAGAGA TGAAACCTTC ACAGACCCAC T GAT AT GG AG GTTGAAGATT 
97151 AAATTCCCCT TTGAGAATAA CTGGGTAACA CTCATACAGA GACTACTTTC 
97201 AAGAAGGCCA GATCCTCCCT CTAATGTATA GTGCAACGTT CCTAACCCTC 
97251 AGCCCACTCC GTCATACCCC CACTCACATG AATACACACA TAAGCAGTAA 
97301 TATAAAGCAC TTCCCACCAT AGGGCAGCAA AGAAGGAGGG AAATCTTTAT 
97351 TATGGAAGAG TGGAAGGAAG GAAGGGAAGG GAAGGGAAGG GAAGGGTAAG 
97401 AGGAAGAATT CTCAGGGTGA GCAGAGGAAT GACATGTTTG GGGCATAATG 
97451 AAGATAATTG AAGTGCAGAG TTTGTATGGA AAAATTTGAA AATATCAGGT 
97501 GGCAGGCCAG GCATGGTAGC TCATGCCTGT AATCCCAGCA CTTTGGGAGG 
97551 CCAAAGCAGG CGGATCACCT GAGGTCACGA GTTTGAGACT AGCCGGGCCA 
97 601 ACATGGCAAA ACCCCATCTC GACTAAAAAT ACAAAAATTA GCTGGGTTTA 
97651 GTGGCGCATG CCTGTAATCC CAGCTACTCG GGAGGCTGAG GCAGGAGAAT 
97701 CATTTGAGCC TGGGAGGCAA AGGTTGCAGT GAGTCGAGAT CATGCTACTA 
97751 CACTTCAGCC TGGGTG AG AG AGCTTTCTTT TTTTTCTCTC ACAAAAAAAG 
97801 AAAAGTTCAG GTTGCAGAGA TGGATGGATG GATGGATGGA TGGATGGATG 
978 51 GACGGATAGA TAGACATTAC AGAGAGTTTC CAATTCTTAG GATGAATTGG 
97901 AATCCTTAAG TCTTTATTCT GTAAGAAAGG AAGGGG AGAA TAAAATTTTG 
97951 T G ATTTT AAA ATATTTTCTA CCCTGTAGAG CTACCCTACA AGGCATGAAA 
98001 ACCTTAAAAA AAAAGGCATC TACTTTAAAA GAATAATGTC TAAAAAATTA 
98051 GAAATTCCCT CTTTTTGCCC TGACCTTTGG G AAAC AG AGT GAGTGATCCT 
98101 TTTGAGGTTT TTGGCACTGC CTTGCCTGTG ATCATATCCT GAACCCTAGG 
98151 TCCATAATCA TGCAGTTACC TCAGATGTCC CTTTCCCTCT AGCCACAGGT 
98201 AACACGCTCT CCAGGCACTG GGAAAGTGGG TAATTAGGAA AGCAGAGGAG 
98251 TACCCATGGG CTGTGATGCC C AGT TAT AAA CCCAGACATT TCAGAATTAA 
98301 CAGAATGAGC ATCAAGTCCT CAAATGGGTC TACATCCATA AACATGTCCA 
98351 GCAGTCAGCT CTTTACTGTC AGTAGAGACA AAATGTTCCT ACACTTTCCC 
98401 TAGGGGAAGC CACATCCTCA GTAGGTTATC TCTGATGAGT CCAGCTAGTC 
98451 ACAGGTATGT AGAAGCTGCA TGCAGCAGAG GGCTCAAAGG AGGGTCCAGA 
98501 ATAGATACCA AAGCAAAAGG GGAGTCTGTG CACGTTCTCA CACGCACCCC 
98551 GAAACACTCT TTTTGTTCAC AAAATAGATG GTGTAGGGTA GTTCCAAGAG 
98601 ATCATTTAGC TCAGGTTCCT GCCTCCATAA AATAAATAAG CCTTCCATAT 
98651 TAGTTGTCTG TTGCTGTGTA GCAAATTGTC AG AAAC GT AG AGGCTTAAAG 
98701 CAATACCCAT TTATTATCTC GCAAGTTCTG TATCTCAGAA GTCCAGGCAG 
98751 GCTTGACTGG GTTCTCTGTC CAAGTTCTCG TGAGACTGAA ATCAAGGTGT 
98801 TGGCCAGGCT GGGATCTTAT CTGGAGGCTC TGAGGACATA TACGCTTCCA 
98851 ACCTTATTCA GGCCATCAGC AGAATCCCGT CTCTTGTGGC TTGAGGTTGG 
98901 AGGTCCCCGT TTCCTTGCTG GCTGTCATCC AGGGACCACT CTTTGCACCT 
98951 ACAGGCTGCC TATGTTCCTA TT C AC AAG AC ACCGTTCATC TTCAAACCAA 
99001 AGCAGCATGT AGAATCTTTC TTGTGGCTCG TATCTTTCTG GCTTTCCCTT 
99051 CTTCTTTAGC CAGAGAAAGT TCTTTGCTTT TAAGCGTTCA TGCGATTCAA 
99101 TCAGGCCCAC CTGGATAATG TCCCTATTTT AAAGGTAACT GTGATACCGT 
99151 ATAACATTTC AGGAGTGATA ACAGCACATT TACAGGTTCC AAGGATTGGG 
99201 GCAGAACATC TTTGGGGGAA CATTTTAGAA ACTCTGCCTC CCCACTCACC 
99251 CATAATCCTT TTAAAAACCA AATCTTGAAG CCTTTTTTTC CCAAAGGCCT 
99301 TTTTGAATAA GCACATTTAT ACCTAACTTC ATCAGACACC CACTTTGAGC 
99351 AAACACTAGC ATGTGGCAAA ATAGGCTGTA AATCAATCAG AACTATTCTT 
99401 TCCCACCACA ATCTTTCTCA AACACATTGG GAGAATCTGA CACTGTCAGT 
99451 GGTATACCAG AGCAGACTCC TACCATCTCA CAAGAGCTGA CTGTTAAATG 
99501 TTTAGTAATT GTGGACATTG GTTGTTAAAC TATTAGTAGC CTGAAATTGA 
99551 CTATAGTGAG AGTATTTTCA CCATGGAAAG CAACCGTTCC AAATCAGGGT 
99601 TTCTCTTTAT TCCTGGGAAG CTGGTTTATT AGCTCACCAC TGGCTGTAGT 
99651 CCTTTAGGGG TCATTACTTG ACCTCCTGTA GCATGCAGGA ATCCTCTCCA 
99701 TGGCCTTTTT TATGCATGGA CATCATCCTA TTTTTTAATA CCAGGAATGG 
99751 GGTGATCACT CTCTTATAAG CTAGTTCATC TCCCTGATGG AATGGTATGT 
99801 GGTAGAGTTG AAACCCACCT CCCTGGAACT TCCCACCAAC TTCCTTTGGA 
99851 AGCAGCACTT GTGACAGCCC CAGAACCATT TGGAGTAAGT AGCATTTCCT 
99901 CCAGGAGACA TCTCTCCTCT GGATCCACAA ATCAATAGTT AGATGCAAAA 
99951 TCTTTAGAGC C AC AC TGTTT GAATTCAATT CCCAGCTCTG CCACTTATTT 
100001 AGTTATAACC TTAGGCAAGT CTCTTAACTT TTCTGGTCCT CTGGTTCTTC 
100051 ATGTGTGGGA ATGGGGATAA AAATAGCACC TACCTCATAG GTTATTATGA 
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100101 ATATTAAATG AGATAATGTG TGCAGAGAAA ATAGCACCTG GTCTGGCCTC 
100151 TACCTATCTA ACAGGTTAGT TGTGAGGATT AAATTACTTA ATATAAGCAA 
100201 AATGC TTAGA GCTCTGCCTA GCACAAAATA AGCACTATGT AACTATTGGT 
100251 AAGTTAATTT GAAATGTGGT TTCTAGATCT CTCATCATCC TAGTCACCCT 
100301 ACTCTGGATG TACTCCAAAG TCCCTCTCAA GATATAGTGT CAGAATTGAC 
100351 CTAATTAGTC CAGCATTTGA CTGAAACGCT AGACTTTGAC TCCAGCCCCC 
1004 01 CATCCTTGAC TGGCACTAGC ATTCAAGCCG CTTCTCCTCT TTGCCTGGGT 
1004 51 CTTTAATAGA GTCAGAGCGA CTTCTCCAGG GGATCTTTTG GCCATGGACC 
100501 AGTAGCATCC ACACACGCTG GGGCCTTGTT AAAAAGGCAG GCTCTCAGGC 
100551 CCCACCCCAG ATCTACTGAA TCAGAATCCA CACATTAACA AGATGCTTGG 
100601 GTGATTCATG TGCACATTAA AGTTTGAGAA GCACCGCTTT CAGGGACGAG 
100651 ATGACACACT TATTTTAAAG AGAACGCCAA TTAGAGACCC TAAGCCTTCT 
100701 CATGGAACAG GGGCCTTCCC CTCAGACCTT GGGAGAGGGG TCAGGGAAAT 
100751 ATCAGTGTTG GGTTGTTGGT GACAGGTGGC GGT GGGGGGT TCAGTCCACG 
100301 TTCAAAGAGC CAGAAACCTG GCAGGGGAAG AGATGGGGCA GTGACACCCA 
100851 ACCGGAAAAA TAAAGGAAAC TACAAGAAGA ACCCAGCTAA GAGATGTGAG 
100901 GCTTCTGAAA GCTCCCATGG AAAGGTTCGC AGCTCCTCCA CCTGCTCGGT 
100951 CCAGCTGCCC CAGGTCAAGG AAGCTCTGTG AGTGTTAGCT GACCCGGAGC 
101001 AGCAAGGATA CATTCAGAAG TGATGAAAGG GAACGCTTCT TGACAGGGTA 
101051 AAGAGTCATT CAGTAGGAAT GAGACAGGAA GAGGTCACAG AGTCAGAAGC 
101101 CCAGCCTGTA CTCAGAGATT ATTTCTGGCA TGGGAGGGCC GAAGGGTTAG 
101151 GAGGCCACCT ACTCACAATA CAATACAGAG GCAGATCCAC TTATTACCTG 
101201 CCTGTGCTGC TGGGATTTCA GTGTGGAAAT TCTGTGCCTC CTCACTGTGG 
101251 CTGCAGCTTG GGAATGACAT CCAGAGCTTA CCCACCTGCA TAAGAAATAA 
101301 GCTATAGGTG TAATAGGGGG ACATAGGCTA AAATCCTAGC TCAGCTGCTT 
101351 AATAGCTGTG CGACTGAGCA AGTTACTTAA CCTCTTTGAG CATCTGTTTT 
101401 CTCATCTTTA AAATGGAAGT AATCATAATT GACCAGGCCC AGTGGCTC AC 
101451 ACCTATAATC CCAGCACCTT GGAAGGCCGA GGCCAGTGGA TTGCTTGAGC 
101501 CCAAGAGTTT GAGACCAGCA TGGTGACACC TCGTCTCTAG AAAAAATACA 
101551 AAAATTAGCC AGGCATGGTG GCAGGTGCCT GTAGTCTTAG CTACTCGGTA 
101601 GGCTGAGGTG GGAAGATTAT ATGAGCCCGG GAGGTTGAGG CTGTGGTGAG 
101651 CCAGATTGTG CCACTGCAAT CTAGCCTGGA GACAGAGTGA GACTGTGTCT 
101701 CAAAAATAAA TAAATAAAAT AATAATATCT ATGTTAATAA AGCAGAAATA 
101751 AGAATGAAAT AAGAGGCCTG ACATGGTGAC TTATGCCTGT AATCCCAGCA 
101801 CTTTGGGAGG TCAAGGTGAG AGGATCACTT GAGCCCAGGA GTTCAAGATC 
101851 AGCCTGGGCA ACTTAGTGAG GTCCCATCTC TACCAATAAT AATTTTTTAA 
101901 AAATTAGCTG GGCATGGTGG CATGCACCCG TGGCCCCAGC TACTCAAGAG 
101951 GCTGAGGCAG GAGGACGGCC TGAGCACAGG AGTTGAGGCT GCAGTGAGTC 
102001 ATGATCACAC CACTGCACTC CGGCCTGGGT GACAGAGTGA GACCCTGTCT 
102051 CAATAAATAA ATAAGAAGAA TGAAACAAGA AAGTTCTTCT TATGGTTCTC 
102101 ATGGTGGTGA GCACAATGTA AGC AT AT AT A TTATCTTAGA ATTCTTCCTT 
102151 CCTGTATAAA GAAGGCCTCC TCCAATGTAT TAATCATCTG TTCAACTAAT 
102201 AAATGCTGCT TACTCCCACT TTCACTCTAA AGGAACTCAA TGGCTAAAGA 
102251 GAACCCTTCC CCTTTGCAGC ACCCTGAGGA TCAGAGGCCT GATTTGAATG 
102301 TCCTCGATGC AAAGGACTAT TTCAAAAGGC CAGCCAGGCA GCCCAGACAT 
102351 GTATTTCCTA ATCGTCTCCA GGTTGTTTGA TAGAAGATCT CCTGGGAGCA 
102401 GGTTTCCGCA GCAGCTCAGC CAGGTCTGTT CTGGGAACGC TGTGTGCATT 
102451 GGCACCTCCC TTGGCAGAAA GCTTGGAGGA AAGGCAGGTG CAGGTCCTGG 
102501 AGCCTCTGAC AGCATTACTG GCTCTAGGAG TAGCTGCTCA GGATAATCTG 
102551 TCCCCATGAC CATTAAGTAA CTGCCACTGT GCGGGAAGAA GAACTGGAAA 
102601 TGGGGGGCCC AAAAAAATCT GAAAACCCTC ACTTGAACCA GTAAGTTATA 
102 651 CCCTGGGTTG CTGTTGGAGA GAGCTTCCTT GGAGTAGACA AATGTGGTAT 
102701 GTT AAGT AAA CTGGGGATCT AGGTTTGATG ATACTGGGTC TGCAGCTTCT 
102751 TTGTCCCACT GAAAATCCTC GGGCATTCCA TGAAAGTAGC CTTCAAAATA 
102801 TTTTTGTCTC TAATGACATA TTTTTGCTGC AAAAAGATGA GTGGATTCAT 
102851 TTTACGAAGT CTCAAGTGTG TTAGAAATTC ACCATGAGTC ACTCAGCAAG 
102901 TTATGTTTGA GGGCGTTCTG TATGCCAGGC ACTGTGCTGG GCACTGGGAC 
102951 TACTGTAGCA AGTCAGATAG ACAAGAACTT GCTTGATCTT GGAAGTAAGC 
103001 AGGGTGGGGT CTGGTTAGTC CTTGAATTGG AGACTGCCTG GAGATACTGG 
103051 ATGCTGCAAG CTTTTGAAAA AAGACAAGTT CTCTGTACTT GCAGAGCTTA 
103101 CATCCAGTAA CTAACTAACT AACTTCAGGC TGTGTTGAGT GACTGAAAGT 
103151 GGTGGAGCCA GGAGTCCTCT AGATAAGGTA GCCATGGAAG GCCTCTCCGA 
103201 AGAGGTGATA AGTTTACTCA GAGACGCAAA CGATCAGGAT AAGCACAGAC 
103251 CCCGGTGAAG AGCGTCCCAG GCAGAGGGGA TAGCAAGGGG ATTGCCCTTA 
103301 GGTGGGAAAG GGCTTGATTT GAGGACTGGG AAGACCAGTG TGTCTAGGAC 
103351 ACATAAGCAA GGGGAGGACG TTATGAACGA GGTCTGAGGG GTCAGCAGCG 
1034 01 ACTGGATCAT GCAAGCTCCC ATAGGCCATG GTAAGGGCTC TGTGTGTACT 
103451 ACAATTACAG GATGCATGAT AGGACCTGGG CTGCATTTTT AATAGTTAAC 
103501 CCTGGCTATA ATGTGGGGAA GGGATTGAAG AAAGAGGGCA AAGGCAGGAA 
103551 CAGGAAAATC TCTTAGGAGG CTACTGCAAA GCCCAAGGGA GAGGTGATGG 
103601 TGTTTTGTTG TTGTTGTTGT TTGTTTTGTT TTGCTTTGAG AAGGAGTCTC 
103651 ACTCTGTCGC CCAGGCTGGA GTGCAATGGC ACAATCTCGG CTCACTGCAA 
103701 CCTCCGCCTC TTGGGTTCAA GCAATTCTCC TGCCTCAGTC TCCCAAGTAG 
1037 51 CTGGGATTAC AGGCATGCAC CACCATGGCT GGCTAATTTT TGTATTTTTA 
103801 GT AG AG AC AG AGTTTCCCCA TGTTGGTCAG GCTGGTCTTG AGCTCCTGAC 
103851 CTCAAGCGAT CCACCCGCCT CGGCCTTCCA AAGCACTGGG ATTACAGGTG 
103901 TGAGGCACCG CGCTGGCCAA ATGATGGTGT TTTGATCTGG GTCTTAAAGG 
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103951 CAGAAGGAAG GGGGGTAGTA AATTAACTGT GCTGGGGAAG AGAGGGAGGC 
104001 CTGAGAGTGA GGAAAGAATG AGGGGTGATT CCAGGTTTAG GAAAACTGGG 
104 051 CAATTTGTTA GATGATGGTG CCATTGACAG AAATGGGAAA GAACAAGTTT 
104101 GGAAAGAAAA CTCAAGATCT GGCTGGTGAC TTGTATTAAA CTTAAAGCCT 
104151 CATTTGTGAC TTGAGCAGAA GTAAGGACTT TCTCCAGTGT TCAAGAGCTG 
104 201 GAAGGGATTT TTCTAGCCTC CAGGCAAGGT AATACCATAA GTCCCAACAG 
104251 TGATGCCCTC CCTGGGAATG ATCTCAATGG GAGAATCCTA TACCCTGCCT 
104 301 CCTCCATTCA TTCCTTGCTC TGATGGTGGT TCTGGCTGGC TAACCTAAGT 
104351 T AC TCTTGCC ACTAGTTAAC GCCTGTCCTT ATTTCTCTTG TCCCCACCTA 
104 401 AGATGTCAAT CAAAACAGCA CGAGCCATGC TATGTCACAT GACATGTTGT 
104 4 51 CTGTCCAGCC CAGAGCTTGT- TGCTGATGGG GGCACAGACT AGATTTTGAG 
104 501 AGAAATCTCT CTGTTACCAC CCTTAACATT CCAACCCCCT CTAATAGCCC 
104 551 ATTTAGGATT TATCATACTG TTTCATCCAA ACCTTTCATG ACCTGATTTC 
104 601 TATTTCCAGC TTCAACCACC CCTTGGGTCA CCACCTGTAC TTATTGAGTT 
104 651 TCCCTAGTTT TCTGAATTAA TGACTGAAGA TGATAAGCTT CCCTTACATA 
104 701 TGACTCTCAA ACCACCAAAC TGGGATTGTT GTTACTCTTA GTGATAATGG 
104 751 TTGCTATTTA TGAAACTTTT AATAGGGAAC ACAAACCCTG CCCAGAAATT 
104 801 CATATAAATT ATTTCATTTA AGAACATCAC AAAGTAGGTG CTATTATTTG 
104 851 ACCTTACACG -TGAGACTTGA AGAACTTTAG AGCATTGCCC AAGGTCACCC 
104 901 AGCTAGTGAG GGGTGGAGGC GGGATTTGAA TCCAGCTCAT CTGTCTCCAT 
104 951 TACCTGGAAG AAGGAAGGCC AGAGCATCAT GGCCTTTCAC AAGTTGAAGA 
105001 GCCACGGGCT TTCTACGGTA GCCAGCCACG CTTTTCCATG ACTGGGGTGG 
105051 GTGTGGCAAG TGATGAGGGT TTGGAGTTCA TGTGGTGGGG TGGCAGGGAC 
105101 CAGGTGTCTT GGTAACTGCT GTTGCATTCA CTTCAGGAGC AAAGGACCAG 
105151 ATCTGATTCT GCAGGATCAA CAATATGGAC ACTGCAGGCT CTGTAGACAT 
105201 CCAAAGCTCT AATGGTGACT TGGGGAAGCT CAGGAGGGCA GGGAGGTTGT 
105251 ACCCATTTAG AATGTAAAGA TTCCTATTTT ATAAAAAAGA AAAAAAGGAG 
105301 ACTGAAGGCC TCAGTCTCCT CCAACAAAGC CAGGCTGTGG GGTAGCAGAG 
105351 TCTCAAAGGG TGCAGGCCCA TGGCCACTGC CCAGGGCTCC TGCTCAGGCC 
105401 TCCTCACTCC CACAACTGAG GGGAGACCCA GTTCCACACC CACCCACCTA 
1054 51 GCACTGTCTC ACACCCACCG GGAGAGGTCT AAACATCTTC CCTGGGAAAT 
105501 GGTCCCAAAA TGTCCCTGCA GTAAGCAACC ATCTGGAGAG GCCCAGGTCT 
105551 AC AT CTGTTT TTAAAGCTCC AATAAATAAA TAAATGAAGG AAGAAAAAAA 
105601 GAAGAAGAAA TGCAGAACAG GGTGACTAAA ATTGGCATGT ATTTTTAAAT 
105651 GTTTATATTA ACAAACTAAC ACCTTTTAAC ATGAAAAGCA ATATAATTGT 
105701 GCTAGCCACA AAATCATCGT AGGACTGAGA AAGGAATCGT GATTCTGAGA 
105751 GCCCTAGAGT TAATGTGATC CAGCTGGCTC ATCCCTGTGA CTGCAGAAGC 
105801 CTGTTTGGAG ATAGTGTCAG TAGCTTTTCA GGCCCTCTGT GAATTGCCAG 
105851 AATGTGTGAC ATGAGCCAAA TTTCCCCCCA GCATCCCCGC CGCCGCCACC 
105901 ACCACCCCCG ACCCAACCCT CCCGCCGGCT CCCATAGAAT AGTCACTGCC 
105951 ATACAGAAAA AGAGAAGTTC TACTATTTCT GGGCAAGATT TCCACAAACC 
106001 AGTTTGTCCC TTTCTGCTTT CATGAAATAA ACCATTTGGA TCAACGTCAG 
106051 CTGATTGCAA AAATTTTCCC TTGTCTCAAA AGCAAGACTG ATAAGGAAGC 
10 6101 AAACATGGGA GGACCTTAGT GGCCGAGCCT TTATGTGTAT GTTATTTCAT 
106151 TGCTCTCATA ACTGCCCTGG GATGCTGTAA GCATGATTCA TCCTGTTTGT 
106201 TTATCAGTTA AATTATGTAT CCAAGATTAC ACAGCCTATC CAGGATTAGA 
106251 ACTCAGAGCC CTCGGCTGTG AAGCTTGAGC TCTTTCTTTT CAGTCTTCAA 
106301 ATATGATCAT GCCATGAAGC AGCACAAAGC CCAGGAGGAG CCCAGTGAGG 
106351 CTGGAGGGGT CCACTGGCAG CCACTCTCCT CCGTGCCCCT GTGGTGTTGG 
106401 GGCAAACTTG GATCTTTCTG AATCTTTTAA CTGTTTCCTT CTCTTCCCGT 
106451 TTTTGTCTGC TGGCTGACTT GTCCTACACT CTACTCCTTG CT TAT GAT AC 
106501 TTATTTTTCC ATCCACAGCA AAACAATTCA CATCAAGGTA ATTGATGATG 
106551 AGGCATATGA GAAAAACAAG AATTACTTCA TTGAGATGAT GGGCCCCCGC 
106601 ATGGTGGATA TGAGTTTTCA GAAAGGTGTA GTACCCTGTC CTCCACACTA 
106651 ACACTAACAT TCTTCTCTCC TCTTCTGTTT CTTCCTCTCC AACCCATTTG 
106701 TCTCCTCCTC CTCTTGTCTT CCACCTCTCT GGTTCCCTTT CCCTTGTCTC 
106751 CTCTCTTGCT CTCTCTCCTG CTCTCTTTTC ACTCCTCCCT CTCCTCTGTC 
106801 CTCTCTCTGC CCCCAGCTCT GTCCTAACAC CTGCCAGCCT GACACATGGC 
106851 ATCCATACGA GGGATGCTCA AGACCGATGG TAATTGTTCT GGGATAAGGA 
106901 AATGAGTATG GGGAAAGAAA GAGCCAAAAT GCTGGAGTAT CATGTGCGGC 
106951 TCTTGGCTTC TCCAGAATGG CTGGGCATAA AGGGGGGAAA AGGGACCACA 
107001 TAGCCCAGCA C C AG AC AG AA GAGCAGCACT GAGAAACAGG CTTTCAGCAC 
107051 AAATTTCCAT GGGGCAGTTA TTCTCAGGGC TAAACTTAGA GTCCCAGGAA 
107101 GTTGAGAATC AATGTATTTG GATTACAGTT CATTCCCCTC CCAAAAGCAG 
107151 GCTTTAGGAG CCACCTTATC TGCCATGTTG CTACTATCAA GACTTGTTTC 
107201 TCCTCCTGAC CTTGAGGAAG CTGAAAGTAC AGGTTTGAGT TCCAGATCTA 
107251 GGTCAAATAT CCATTTGTCT TCCTATGTTT TTCCTATTAA GAACACCCAG 
107301 GTGTGGAGGC AGAGAGTTAG AATAGTGGTG GAGATCATCC TGACCCAAAT 
107351 GGAAGCTTCC CCAAGAGGTC CATGGGGCTT CTCAGAGTGG ATGGAATCTT 
107401 TGCCTTCAAC TTCAATGACC CCATACATCC CATGGCCTCC AATAGACAAG 
107451 TCAAGAAGTC CTTTCCTGAA TAGATCATAC TGTGGAGCAG GGAGCTGCCA 
107501 GTACTGAGGG CAATGTTCCT TCCCCTTCCA AGCTGTCCCT CATGCCCTCC 
107551 AGTACATGCC TGTTGTCACA GAGCACCCCA ATCCCATCCC ACAGCAGAGT 
107 601 TCCTGCAGCA GAGAAACAGG CTCACACCTT GTAGACAGCC CTGGGGTCCC 
107 651 ATATCTAGGG CCAACAGAAA TATTCCCAAA AAAATGCCTC TTGACAATCA 
107701 ATGAGCTTTC TCTTTTGTCC GCTGAGCAAG GTATAAAAAG ATGTCAAAAG 
107751 AAGTACCCAA AAAGGTAATA AAAATGTACA GTCGTGCATC ACTTAGCAAT 
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107801 AAGGATACAT TCTGAGGAAG GTGTCCTTAA GCAATTTTGT CATCGTGGGA 
107851 AAATTATAGA GTGTACTTTC ACAAACCTAG ATGGTGTAGC CTACAACACA 

107 901 CCTGGACTAT GTGGGCCTAT TGCTCCTAGG CTACAAACCT GTACAGCATG 
107951 TGCTTGTACT GAATATTGCA GGCAACTGTA GCACAATGGT ATTTGTGTAT 
108001 CTAAACACAT CT AG AC AT AG AAAAGGCACA GTAAAAATAT CGTAGTATAT 
108051 AGCCTTATGG GACCACTATT GTAGATGTGG TCTGTCATTG AGCAAAACGT 
108101 TTTTATGTAG CATGTGACTG TACTTGTAAA GTACACACAC CACAAATGCA 
108151 CAGCAAGTCC TGTGCCCTAC AAGCCCCTTT GGGTCAGTCT ACTACATTAT 
108201 AAATGGCAAA GCCGAGCACG CCCACAGAAG GTAGCAGGAA CATCAGAGGA 
108251 TCTGAAGAGA CATTTAGGTA AATGCTCTTT ACCCTTTAGA GCATTTAGTT 
108301 CTTAGGCCTC CCCTCCCCCA ATCTCCCCCC CGCCCCCCGC CAAAAAGAAA 
108351 AAGAAAAAGA AAGCAGAAAA TTACAATTCT GGCTCACTAG TAGGACCTGC 
108401 TAGCCACCAT TGTGATTCCA TGAAGGACCA GAAGAAACCA TATAGGAAGA 
108451 ATCAGGCCCA CACGGCAACC TCTCCACATG ACAAAGAGCC AGTCTTTGGA 
108501 GGGCAGTGAA TTTCAAGGAA AGTTTTCTTC CCTGGGTGAC TTGTTTTTAA 
108551 AAGATGTTAT GTTTTGTTGA GATACCCAGA GATGAACAGA AACTTCCATC 

108 601 ACCTTGTGCC CCAGACCCAT GATAATTCAC ATTGAGGAAA CCAGTTTTGG 
108651 AACACATCAC CCCTAAGTGA TAGAAGCCCA AAGGTGATTT AGAAT TTG AT 
108701 GATTTACATC ATTTTCTTCA CATTTTCCCA GAAATGCATC AGCTGTAAAT 
108751 AGTAAAGGAT TCCTATGTAA TATTGTGGTT AATACATATT TATTTTAGTT 
108801 CCCACCACTG AAGCCCTATG AGATAAAGAA TGAGAAAGAT CACACAATTC 
108851 TACCTCCCTT TCTTCTCTCT CTCTCTCTCT TTCTCTTTCT CTCTCACTCT 
108901 CTCTCTCTCT CTCTCTTCTC TTCCTCTGTC TGGTTTTCCT TCCTCATAAA 
108951 TACTTTTCTT TTAAAATTTT CTTTCTGAAA CTCACAATGG AAGTGAGTAT 
109001 AGACATAAAG AAG GG AC AC A AGCCCTGGGT TCTGTTGACA TATTCCCTGT 
109051 TGTGGGAAGA CCCTGGGTTA TTCCCAGTGG GTTAGTAGTT TACCTGTTGC 
109101 CCAGAGAAAT GCCACTGTTA TCATGTGACA CCCAGTGGAA TGTGCTGCCT 
109151 GACTCACTTC CTACTAACTG TTGGCAAGGT CTAAAATGAC TCCTCCTCAC 
109201 CATTACCCGC CTTCTGCCTT CTCCTCCCCT TCTGTCCTTC TGGCTCCCTT 
109251 CCTTTGCCCA CCTTTCCTTG CCTCCTGGCT CCCTGCCCCC TCACCCGTAA 
109301 GAACAACTAT GACCAAGAAG ACAAGAAAAA CTAAGACCAT TTATTACCTG 
109351 AGAACAACAC AATCCACCAT GGTCCTGTTG AAAGCCACCA TGGTG GG ACT 
109401 GGACTGCATG TGCCAGGAAT GACGGGGAAT GATTTTAAAG GCTGTGCTCC 
109451 AGGTGACCAA CCAATCTACC GACCCAGTCG ACACACTCTC TCTCTTGTTG 
109501 TCCCTACAGG AAAACCATAA GGGTTAAAAT AGTAGATGAG GAGGAAT AC G 
109551 AAAGGCAAGA GAATTTCTTC ATTGCCCTTG GTGAACCGAA ATGGATGGAA 
109601 CGTGGAATAT CAGGTGTGAG ATTCTTTAAA AACAAAACAA CAAAAAAAAA 
109651 GAAAGAAAAA TTAAAACAAA CTGAAAAACA ACAACAAAAA AGAAAAAGCA 
109701 GCTATATTTT TGTCTCCCTC CTTTTCTTCC CTTCTCCTCC TTTCTCTTTT 
109751 TGACCAATGG ATTTTTTTAT TCTTTTCCCT CCTGTATTCT CGCTCTCACC 
109801 CTGTTTCGGT ATCATCTCTG CCTTCTTAGC CTTAGCTTAT TCCAAATTCC 
109851 TCCTTTACCG CCTTCTGGGC AGCACTGCAG CCTCAACTCC TCATTACCCT 
109901 AATGAGTTAT TTCCCTGTTT TGCTACAATT TTCAATTATT CAATTGCCAT 
109951 GGGCCCCTGC ACTCTCCCCC ACCCCACCCC TACACTGTAA CCTGTAAATG 
110001 TGAAAATTCC TTGGTGGGTG GGGAGGAGAA GAAAAAAAAG GAATGTGATG 
110051 CGATGCATGC CTGTGCCCCT TCCTGCCTTC CTCCCCTGCC ACCCCTCACT 
110101 CTTTAGCCTG GATTGAATGT GGGGGGGTCT GGGATGGGGG TTGGGGCCTG 
110151 GGTTGCAATG ATGCTTTGAC AGTTTTCTGC TGCATTCCCC AACTTCCTTT 
110201 GAACGCTTGG CAGGTTATTC ACTTGTGGAG TGGCCCATAG GCCCCTCTGC 
110251 CCTTCGAGGA GGTAAGTGTA TTTTCTGGCT GTTTCACAGT TGGGCAGACC 
110301 GTGGCATGGG AAAGTGTACC AATTGTCAGA AGCCACGGCT TCTGAGAGCT 
110351 CTGAGAGAGA GAGTTGACTT CTGGGGTAAT CATGCAATCT GGAATTCTGA 
110401 GCTATTCTTC CTCCTCTGGG CATCCCACCC CATGCCATTC TATGTTCCTA 
110 4 51 GCCCAAGGTT GGGTGCCTCA TTCAGGCTAC TTTGGGACAA TGCAACCTCT 
110501 AAAGCAGAAA ATTGAGAGTT CCTGAAGGGA AGGAAATAGT TCCAGGTATG 
110551 AAAATTCCCG TAGCCAGGGG CCCCAGAAAA GGACTGACAT TGGGCAGGCC 
110601 TGGAGTGTTG ACTTGTGGAT TTTCCAACAG AAGAGACTCT AAATGATGCA 
110651 GTTGGTGCTG ATCCCTGACA GACAGGTGTT GGAAAGGTCA CAGATGTCTG 
110701 CCTTTGCTTG GCATCTGCAA GAGAAAGTAC CGCCCAGATC CCAAGATAGC 
110751 CCTCATCCCA CACTAGAGAA GTGGCCTCAT CTCCTGCTTT CCTCAGGACC 
110801 TGCATCTGAG AATACCTGCC AGGGGCTCAT CCCTAAAGGA CTGATTATGT 
110851 TGCAACCAGG GTAGAAGTAA GGAAGGATTT CTTCCCTTGA AGAAAATGAT 
110901 TGGAAGCCAC TACTTTGAAT GGCTTCCAAT CATTTGGAGG CATAGATGTG 
110951 GGAATGGGTT AGGGTGCTCC TGGGAAATAA CAAGAGGACG TTCACACTCC 
111001 CATTCAGGAG AGATATGCTG CTGGGAGCCT CCTAGCAAAT GAAGCAGTGA 
111051 AATCCACCTG TTTGTCAAAA AGGGGTGATC ATACTGCAAT TAGTTCATAT 
111101 TCATGTGACA AAGAGCAGCA TAAAACTTTC CACACGAGGA CAGAGCTAAG 
111151 AGATTCAGCA ACAACATTCC CAAAGGATTC TCTACAGGCC TTCTCAGTGT 
111201 GATTGGTCAT TTCTCATTGT CTGCTGGGGA CTCTCCTGCA GAGCTGACCA 
111251 CTTCTGTGCC TGCGCTGGTT TGGACACACC TGATGCTCTA GGGGCAGAAC 
111301 TCCTCTCCTT CTTCACTGCT GGTTCTCTTC GTCACCACTC AATAAAACGT 
111351 TGCCCTCAGC CTGACTGCCA AAAAGTGCTG GAAGAAAGAA ATTATCTCTG 
111401 GTTCTATTGT TTCCCACATT GTATTCTTGC CCAACTTCCA GTTCTTGCCA 
111451 CCAACAATAT TCTCAGAGGT TGCCTCAGCA CCTGCCCTAC CTCATTCCCA 
111501 CCTCCCTTGA GCATTTATTC CATGTATTCA TAATTGGTTG GAAGCAGCAG 
111551 ATACCCAAGG CCAATTGTAA GTCACCTTCA TCAGTTTCCA CAGTCCAAGC 
111601 TACTTAGATG CAAACGAAAG CAGCACATGT ACAGCGTACA GGAAGGAAGG 
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111651 CAGTGGTTCC AGACAAGAGG AAGAGATTGG AAGTCCATAC ATGCCTTTAT 
111701 TCCACCAGTA AAAAGGCTCT TCTCTTATGC CTCCCTTAAA ACCTCTACCA 
1117 51 ACAGCAGGAC AG AG AG TG AC CCAAGATAAG TCTTCAAGAG ACCTAACCAA 
111801 ATGCAAATGT CTTTGGCTAA TCCCCATTTA AGGACATCTT CCTGTTTTGC 
111851 ACAGATTCTT TGCCCAAGGA AATGTCAGCA ATGCCCTCGT GGAGGGAGTA 
111901 GGTGAGAAGA CAAGGATTTC AGCAAGCTAT CTGTGTGGTG TGCCCCCAGA 
111951 TCTCCCCAGT GACCGAGATG CCAAGATGAA GAGTGCCAAG AAGAAATTGG 
112001 TCAATTTTCC AGCTGCCTAT TTTATTGTCT ATGTTTTCTA GGCGGTTAAT 
112051 TTCCAGTTTC TTCAGTACTT CCCGTATTTT GACATTAGAC CATAAGGTGA 
112101 AAGGTCATAA AACCTGATTG TCTAGACTCA GAAGCAAATG GAAACCCATC 
112151 CAAATTTCCA GAATTCCCTG CTGTTCTCAG AGTGAGAAAC AGAACAGTGG 
112201 AAATTGCTTT TCATTATCAC TACTGCATGG GAGAGTCTGA AACATTCAGA 
112251 ATGGCATAGT CTTTGCATGG TCAAAATGAC AATTGCATTA AAAAAATGAG 
112301 AGACTGGATT TGAAATAGGA GACTCTATTT TTGGCAAACA AAACAGACTT 
112351 CAGAGTTGAG ATTAAAAGCT CTGGATGAGC TGGGGGATGG AAAAAAGGGA 
112401 AGGAAAAAAG GGAGACTGAA TAGGAAACAC AGTTGCTCTG GAGTCTAGAA 
112451 GTGGACTTCC GAGAGCAACA CTGAGCAACA TAATCAAGAC TGTTGGGCCT 
112501 GGGCCTGGAC ATTGGAAGCC TTCGGATAGA AAGGAAAGCT CTCTGTCTCT 
112551 CTCTCTCTCT CTGAAGAATG GGGCCTGTTT GGTCCTCCTT TTTCGACAAC 
112601 CGTGGGCTCA TCTTGACAAG CTGCCCAGAT GCTTCCTAAT TACTCACAGT 
112651 CCTATGCTCT TTCCAGCTTG TCCCTGGGGT GTCTGAGCAG GAATAAATGA 
112701 CTCTCACCTG ACCCAGGGGA TCAATACAGG GGAAAGTTCA GCTCCAGCTT 
112751 CTCTCATGAG CAGCAGCAGG AAAAACACCC TCGAGGTATT GTGTCAGTCA 
112801 AAGCTGGCCT ACCCAGGTCT TGCTGACCCA TCTATAACTG CTGAGCAGAA 
112851 AGTCTTGGAT TCATGGAGAC AATGACCAGA GAATGATGGA ATTCCAGCCA 
112901 ACTGCAGGCC TTCTCACTAC TCTAGGGATG GGCCAGATGT TCGGTGGCAT 
112951 GTATGAGTGA AAACCAGGGC ATCAGGGACC TTTCTGGAAG AGCTGCCTTT 
113001 GTCTGACCCA CCTGTGTTCA TTTATGTGCT GGGATCTCTG ATCTCCCCTG 
113051 GAACTTGGGG GAAGCTCTTC CACGCAAACT CCCGGAAGGA GCAGAATAAA 
113101 CAAGCTCTTG CCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA 
113151 TCTATCTACC TATCTGCCTA TCTATATCTA TCTATCTCAA TGTAGTGAGG 
113201 AAAGCCATTG ATCCATTAAC CTTTGGAATT CTACATGGGA GATACCTAAA 
113251 AAAGTG A AC T GCCTTGTTTA TGTATCATGC AGACTCTGGA TCCACATATA 
113301 TCTCAGTGGC TGTGAATATA GGATGATTGA TCACAGGCCT GAGTTGCATT 
113351 CCTACAGATT CTTAGGAAAA AAATTG ATT C ACAGACATGT CCCCCCTGGT 
113401 TCCCCCACAA CACACACTCC TTCCTCAGCA ATCTCTATCA GTCACCAACT 
113451 ACACGTTGAA TATGTGGCAA GCTCTTCCCA GACCTTTATC TGAGAGCCAA 
113501 GGAGTGAGGG GCTGTACTAA GATATCATAG AAATGAAAAT GTGGTGTGTC 
113551 ACAAGTTTCC TTAATTCTTA GATCTTAAAC TCTAAGAGGG TTCAGCATAA 
113601 GTACAAATTC AAGGGCTAGA GACAACCTGT ATTGGGTGTG TCTTTAACTC 
113651 AGTTTCCCAA TCCACATAGG GACCTTGCAT TTGTCATCTC TCATCTATGT 
113701 ATAGCTGTTG GTATGACAGT TTCTCTGTTC CAGAATACCT GAACTCTGAC 
113751 TTAGCCTGTC CTTTCTGAAA CAGAAAAATC ACCCAACCAG AGATCTATGA 
113801 GATCTATGGA AAAGACAGTT GCC AAAAT AG ACAGCAAACA GCCAAACTTA 
113851 ATTGAACACT ACCACATGCA GGGACTTTGC TAAGCAGAGG TGATACAAAA 
113901 TGGGAGGAGC CCATAGCCCT AACTTCCAGG ATATATCTAC GGTAAAGACA 
113951 AACCATTCAA GGAAAACATT CTGCAGGACT TACCTTTTTG CTAAGTCATT 
114001 CTTTTAGGGG AAATCAAAGT TCTAGTCAAC GTGGCAGCTA GGAAGGCATT 
114 051 TGTGGTGATG GAAACCTTAT GAGCACTGAG AAGCTGAGCA TGAGTTCAGC 
114101 TAAGTCGTTA GGGATGGAAG ACATAGACCT GGGCACTGTT CCACTCTTGC 
114151 ACAATGCTAC CCATTTCCTT GAGCTCCCAT TCAAGCCCCA TGGTCATTTT 
114201 TGCCACTCAT AAGTTAGCTA CTCTGGCAGG GTTGCAACTT ACACAGTTTT 
114251 CATGATAACT GGATTCTCAC TCCTTTTTTT ACAGAATGGA TGTGATAACC 
114301 TGGTATCCTA CACAGTCATG AGTGACCAAC CTACCCATTT GGTTCCCCAT 
114351 CCTCATTCCT CCATTCCTAG CCCTAGGGTA GCCGGGAAAG CATAGGAGCA 
114 401 AATGCCCTTA CCAGGGCCCT GGTGCTCAGC AGCCTCTCCG GCTGCTCACA 
114451 CCTCTTGCTG CTGCTCTGTG CATGCTCCAA AGGCTGCTTT TTGCGTATGG 
114 501 CTGCTGAGCT CTCACCTACT AAGCTCTCTG CTTTCCTTAT GCTGCCAGCA 
114551 ACCACAAAAC CTGGTGATAC TTTCAAGATG GGACATTAAT GCTCTTTCCT 
114601 TTTCTTTCTT CCATTTTTCT GGTATCCATT TGCAAACAGC GCTCCTGTTA 
114651 TCTCCAGGTA AGAGGTGTCT TGTCCCCCTC TTTTCTTTCC ACTTCTTGCC 
114701 AGTGCCATTA TTTGGTTTAA GACCAATGTC CTTTGATTTA TTGAATAAGA 
114751 ACTGCAGGCT CAAGTTAACC TGACAATTTC TCCCAAGGAC TGGGAGATTT 
114801 ATTTTCCCAC ATGAAGCAAT TATGAGAAAG CAATTGTGAG GAAGGCAATT 
114851 CCTTGAGCAT CACTTCTGTC TGGGGACGTG GGTTAAGGCA TAGCTGATCC 
114901 TCTCTGGGAC CAGGAAGAGA AATTAAGCTT AACAAGGAGA TGGTGGGTCA 
114 951 TAGACTTCTC CTGAGTCTTA ATTCATCTGC CATCTCATGT TGTGGGGGAA 
115001 GAGACAGTGA GATTCAGAGC TGGAATCTCC TAATATAATT GTGACAGGAT 
115051 TTGAAAAAAA AATACTTTAA TCCCAAGGGA TCCAGGAAAT AACCAAACCT 
115101 GTTGTGAGAA TAGGAAATGC AATTTTTAAA GAATCTGGAA TTTTACCAGT 
115151 CCTGGAGATC TTCCATCTCA TCACAGCTGA GACTTAAATT GCTAGAATTT 
115201 TGGTTCATTT GTCATTGACC CTTAAAGTCC TATGTGCCGT GAACAAGATG 
115251 AATTAGGATG GGGGATTGGG GCAGTGTTCT GGCTGGAAAT ATAAATTTTA 
115301 GAGAATTTAT TTTGAAGAGA TTCTCATGCA GAATCTAGGT GCTATAGAGG 
115351 ACGTACACCT ACTTTGAGAG TATGCTTGCA TGAGTGGAAA CCAATCATAA 
115401 ACAACATTCA ACTTCATGAG CAGATATGAA AGCATTTTCA GCATATCTAG 
1154 51 CAATACTATA ACTCTTTGTG CAAGCAGAGT GGCCTACACA AGACAGTTTC 
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115501 AATATATTTT AAAAGAACGT CTTACATTTC ATCAGTCCTT TGAACACAGA 
115551 AAAAAATGTT AAGGCCACTT AAGAGGCAAA ACATCTTACA GAGTTCATTG 
115601 ATATTCAAAG TCACCTACAG GCTACATCTT GGGTTCAGGA AGGGGCGGTG 
115651 TACATAGTAA GGACATACGC CTTCTGGGAG CCTTAAACAA ACAAAAAAAA 
115701 TGTAGGTAAC TCCTACATTT TTCTTTTGTG GAAAAAACAC AGTTACTCCA 
115751 GCTTCCTTGG CTTTTTGCTT CTTTTTTATA CCAACAAAAT AAGGGCTATC 
115801 CTCAACCCTC TGTTCTTCAT TCTTCTCCCA GGGTATTGAT TTCATAACAT 
115851 TGGGTTTTTC TTCTCTACTT CACTCATCCT CTTGCCTGTG AAGGTATGTA 
115901 AGGCTTCTTT GTTCCAACTC TTTCCTCCAC CCGCCCCCCC TCACATAAAT 
115951 GCATAACAAA GATTGTGATT TAATTTAAGT TTCTTTCTAC TTTTAACATA 
116001 TTTGCAAACA TCAATAGAAG CTAAAATGGG AAAAAGGAAA TGTTTCTTTT 
116051 CCTAGCTCTT TCAATCTGTA AGCCTTTAAT TTAGGAGCGC TGATTAGCCT 
116101 TTCAATTCGT TGGAAATCTC AAATACTGGT TTTAATTTTC CTAGGTGGAC 
116151 AGAGACAGAG GGAATATGTT CATTCTGAGC TAACCACCCC CCCACCCCCA 
116201 AGCTCAGGCG CCTTGCAGGA AGAGCACTAG CTACATCACT CTGCAGAGTG 
116251 TTCACAACAT CCTATTCTTG TCTGGCCTGG CAAGCTCTTT GTCCTTCCAA 
116301 TATTTGTTCA ATCTTCCATC CTATTCATAT TCTATCTTTC TCTCCCCTCC 
116351 CAGCCTCTCT TCCTGTTCCT AGAACTGAGA GTTTATTTAG TCAGTCTGAA 
116401 TATCTAGATC ACCTGCCATT TATTCTCTTT ACTTGAAATT CTGAGGAGTC 
116451 ACATAAACAA GATATCAGAA TCACTATGGT CCTCTAAATT GAAGACTTAT 
116501 AATTCTCTCA AGAAATTAAC AACATTTGAA TTTAAAGGAA AGATCATGAC 
116551 AAAAATAGAA AAAGGCAGGA ATTATTGCCA AACCGAGAAA CTAGAAACTA 
116601 GAATTAACTT AAAGGCATGT GACTCAATCA ATTAACAAAT ATATACAGAG 
116651 AGCCTCTGTG GGACTGTGGG AGATCCAAAG ATAGAGGATT GGTTATTTGT 
116701 CAAAGGGATT T TT GC AG AAA GCTAGATGGA AAAACTGACT GTCACCACAG 

116751 AGGTGGACAG GTCAGTAAGT AGATCAATAT CCTGCCAGAT GGATATAGTG - <* 

116801 CT AG ATT GAT AGGT AG AC AA GGGGTTAGAC AGGTACATTT ATATGTCACT 

116851 GGAGAGCTCA TTATATTGGT ATAAAGTTAT TGTGTCACAT GTAAAGTATG 

116901 ACATGGGGGA ATTGGGGAGG AAGGAGTGGA ATAATACTGT CGCTGCTAAG 

116951 ATAGGCATTG TGATATGGTG CTTAAACCTG CAAGTAAAGG AAAAGAGTAT 

117001 GGAATCTGTG TGTCTTTTTC TAAGGGCTTT TTCCCAGAGT AGCTTGCAGT 

117051 CTGGCTTCTA GGGTTGCTGG CCTATAGCCA GAACCCTAGA TTCACCCAGA 

117101 TTTACCTTCA GAATTAACTA ATCAGAGACT CAAATTCAAT AGACTAAATG 

117151 AAGTCAGGCT GCTAGAGGAT GTCTGCTGAC TTGGACATAT GCAGAAAGAC 

117201 ATGGATCCTT GAGAAAACAT TGTTTCCAAA AGTGGCCACC AGC ACT AG AG 

117251 GAAGGACAGC ACCACGGACA GCTCCCAGAC ATTTTAGGAT TGCCTTCTGT 

117301 GTTTGGTGCC CGAACACTGA GC AAA AC AGC GAACTCAGGA AGTCTCCACA 

117351 CACTCTCATA CCATCTTCAT GCAGTCCAAC TAAGAAAATT CTTACATAAA 

117401 ATATAAGGCT GTCTGCTTGG TAATTTAAAC CCTTGGCTTA TAGTCTTTTC 

1174 51 AGTGAATTTC TTTCCTTGCA AACTCGAGAG TTGGAGTCTC ACGACTGCCC 

117501 TTGCTTCACC AATTCCCCAG CTAGAGACAA AAGACCTTCT TGGCCTCTGA 

117551 CCCATTTTGT CCTTGAGATT ATCCAAGGAC TACAGGATTC CCCTAGGAGG 

117601 TTTACTGTGT GGAATGAAAG CAATTAAGGA GCTGAATAAA AGAAATAATT 

117 651 GCATGTGAGA ATGTGGACTT GGATGGGAAG ATGTTTAAAT GAGCTCTGAA 

117701 AGAAACAAGC TGCCAAGAGC AATTTTCTAA TTAAAGGGGA ATAAAAAGAT 

117751 TCAATCTCTA TTTCACTCTA ATCCAGAAAA CATGTCTTCA TGGAGAAGTG 

117801 CTCTTAAAAT GGACTCATCA GCCAAAGTGG AAAAACAAAA AACAAAAAAA 

117851 CTGTTCAACA TGAGAAGGGA CCATTGGTAA ATGAGTCAAG ATGCTGTGAA 

117901 ACCAGTAGAC ATTTCCTTTG AATAAATGTA CTTCTGCACC TTCAAGAACT 

117951 CTTACAGGAA GTGGTTGAAC AAACAGGCCC AAAAGTTCAA AATAGTTCAA 

118001 GGTCAAAACA CTTGCCCTTT CTTCCCAGTT CCCCAACATC TCACTGAGTG 

118051 TCTTGAGAAC TTCACTTGAT GCTATTTCTC AGGAGATGTT TAGGTCAGGT 

118101 TGTCCACCCA GGTATAAAAG AGAAAGAGGA ACGCTTATCC CAGTCTGCAA 

118151 GGCACATTCT CATGGTCTGG TTATAAAGTG TTTAGTACTT CATAAAAAAG 

118201 GCACTAAAAA TATATATAAA CTCCCCATTC CCAAGAGTTA TTTGCTTTGT 

118251 ACCCACTGCC CATGCCTAAT ACTCTGAGCT GTATCCTTCC AGGGAATGGA 

118301 AAAGGTGTTA AAGCGAGTCT GATTTTGTTT TGTTGCAGAT GTGACAGACA 

118351 GGAAGCTGAC TATGGAAGAA GAGGAGGCCA AGAGGATAGC AGAGATGGGA 

118401 AAGCCAGTAT TGGGTGAACA CCCCAAACTA GAAGTCATCA TTGAAGAGTC 

118451 CTATGAGTTC AAGGTCAGGC AAACAGTGAG GTCTAATTGA ATAATAAATA 

118501 AATT AAAGTG GGAGGCAGAA GACCTGGGGT GTTTTTTTCC ACTTTCACTA 

118551 GTGAATATGT GAAGTTGAAA CTGAACAAAT CACTTACCCA CCCCAGGTCT 

118601 CAGTTTCCCC ATTTGTAACA TGAAACAAAT AGTGCTGACC ATTTGTATGC 

118651 TAGGAATATT GTTAGGAAAC ATAATATAGA ATGTGAAATA AGTGGACTAG 

118701 AAAGTCCTGA GATGTATTAT CATTATTGTT TAACTGTGTT TTTAAAGCAA 

118751 AAATATTAAA ACTCACTACT ACAGGGCAAG ATATATTAAC ATCATTATTA 

118801 TTATTCATTA TTGTATTATT CTAAATAGCC AATTTCAAAA GTCACAACCA 

118851 GGCCAGGCAG TGAGGGACTC ACGCCTGTAA TCTCAGCACT TTGAGAGGCC 

118901 GAGATGGAAG GGTCACTTAT ACCTAGGAAT TTGAGACCAG CCTGGGCAAC 

118951 AT AG GG AG AC TCCATCTCTA TAAAAAATAA AACAAAATAA AAATCAGCTC 

119001 AGTGTGGTTG TACATGCCTG TGGTCCCAGC TACTCAGGAG GCTGAGGTGG 

119051 GAGGATGGCT TGAGCCCAGG AGGTTGAGGT TGCAATGAGC CATGATTGCA 

119101 CCACTGCACT CCAGCCTGGG TG AC AAAGTG AGACCCTGTC TCAAACAAAA 

119151 CAAAACAAAA AGATTACAAC CAAAAACAAA GGGAAATAGA AGGATTGCCT 

119201 CAAAAGAGAT CGCCCAAGGC CATTCCATGC GTAACTGTCA GAACACCTTG 

119251 GAGACAGGGC ATCTTTCATT CCTTTGAAGA ACCAGACTCC TCATTGGTTC 

119301 TGAGCATTCT AACCTCATGG TTCCAAGTTT TTCTCTTCTT AACAGACTAC 
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119351 GGTGGACAAA CTGATCAAGA AGACAAACCT GGCCTTGGTT GTGGGGACCC 
1194 01 ATTCCTGGAG GGACCAGTTC ATGGAGGCCA TCACCGTCAG TGCAGGTGAG 
119451 AAGTGTCTCA GGCTGGCCTT GCTGGGAGAA GCAGGCAACC TCTGAGAAGG 
119501 AAGCGTAAAG CCACGTTAAC AGCCTGCCAG TCCCTAGGAA GGCTTGTGTG 
119551 TTCAGTCTTC CCAGCTCTGG TCCTAGGTGC CTGCTTGGAA AAGAATCATG 
119601 GCGTATCTGA AAAACATGGT TATCTCTGGT TTCAAATCGT TGTTCTGCTG 
119651 TGTGAACTGG AACAATGTAC CCTCTCTGAC CTCAATGTCC TCTTTCCAAA 
119701 GGGGAACTAT TGCTACCTTT CTCAGAAAAG TAGAAAGGTA CAGAGTCTTG 
119751 TATAAAATCC AAACTCAATA AATTCTGATT TCTGTCATTC TTTCTTTTCA 
119801 TGGGTTTGGT CCCGCTCTTC TGTAAAATGT GGGACAATTC TGATTTAGAG 
119851 ATGTGGGAGT TAGGAGTTTA TAAAATGTGT TGCATTGACT CTCCAACAAA 
119901 ACACTCTGGA TGATTCCATA CCCCTCCCTC GGCATTTACT GACAGGCTCC 
119951 CTCAGTAGTG ACCCACAGCA CAGCCGGGAG TCCTAGCAGC CTGAGGGGAC 
120001 TGCTGGTTGG AACAGGGACG GAAAAGGTCT CCCAACCACC ATCACTATCA 
120051 CCTCTCAGCA CCACTGAGGC CTCCTGGCCT TGTCTTTTAT TGAGAGACTT 
120101 TGTTGTCATA GCAACCCACA GGGTCATATC CCCAAGGCCC CAGAGCCAGA 
120151 GCAAAAAGAC AGCCAGGAAG AGAGGTTTGC TGCTGCTGCT GCTGCTGCTA 
120201 CCCCACTTTT CTCATCACCT GCTTTAGATC TTTCTAGCTC CCCCTCTGAT 
120251 GACCTGACTG TGCCCCTCAA GACAATAAAC GGAATGTAGG CCACATCATC 
120301 TACCCTGCTC CTTTTACAAA GGAGGGGACT GAGGTTCAGA AATAAGAGAT 
120351 GATTTACCCC AGCTTACAGA TTTTCTTCAT GGCAAAGCTG GAATGAGAAC 
120401 CCAAGTGTTC TGACTCCTGT TCTTTCAAAA CCCAGCTTCT ACCGGTTATG 
120451 CCAAAACATG ACAGAAGTTG CCGTTGGCAA GGCACAGGCA TGCCTCAGCA 
120501 TACCCTCCCC TCCAGGGCTG CTGAGTGGGC AACTCTGCCC ACATTTCCTG 
120551 GCAAGGACAA TCAAGGCCCA TCCTGCTTTT TCCCATGAGA TGTTTGGAGG 
120601 AGGGCACTGG CTCTGCAGTA TATTCTCGTG ATCTGGAATG ACAGCCATCC 
120651 CTCAGGGGAC AGATAATGAC CAGAACCACA ATGGTTATTG CAGCAGTCAG 
120701 GTCAGAAAAT TTGAGAGGAG CCCTGCTGGC ATCCAGTGAA GAGTGGCCAC 
120751 ACCGAACTGA TTTCACTTCT CTCCTTAGAC AACAAAATGC AGCCTGTGCA 
120801 TTCTCCTTTC TTTTTTTTTT TAATTATACT TTAAGTTCTG GGGTACATGT 
120851 GCAGAACATA GAGTTTTGTT ACATAGGTAT ACACGTGCCA TGGCGGTTTG 
120901 CTGCACCCAT CAACCCGTCA TCTACATTAG GTATTTCTCC TAATGCTATC 
120951 CCTCCCCTAT CCCTCACCCC TGACAGGCTC CAGTGTGTGA TGTTCCTCTC 
121001 CCTGTGTCCA TGTGTTCTCA TTGTTCAACT CCCACTTATG AGTGAGAACA 
121051 TGCAGTGTTT GGTTTTCTGT TCTTGTGTTA GTTTGCTGAG AATGATGGTT 
121101 TGCATCCTCC TTTCTTTCTG CTCCACTGTC TTGTCCCTCT TAATCTCCTT 
121151 CTTTCTTCTC TTCCTTATTC CCTGGCCCTC TCTCTCCCAC . TCTACCTTGG 
121201 TGCCCTGCAT TCAAATTGAC CTATGAGGCA GCCCAAATTG TTTCCCCACT 4 
121251 ATTTTCTGGC ACGCTGGCCC TGGCCCCCAC CAGCTGCCCA GAAGACAGCT 
121301 GGAGTCCCCT TCTAGCGGAT GATGCCTGTG GTGCGGGTTG GGCTTGACTT 
121351 TCTCATGAAT GATTATCTGA CTTCTTACCC GTTCTCTTGC CTGTTTATCT 
121401 TGCCTTCAGC AGGGGATGAG GATGAGGATG AATCCGGGGA GGAGAGGCTG 
121451 CCCTCCTGCT TTGACTACGT CATGCACTTC CTGACTGTCT TCTGGAAGGT 
121501 GCTGTTTGCC TGTGTGCCCC CCACAGAGTA CTGCCACGGC TGGGCCTGCT 
121551 TCGCCGTCTC CATCCTCATC ATTGGCATGC TCACCGCCAT CATTGGGGAC 
121601 CTGGCCTCGC ACTTCGGCTG CACCATTGGT • CTCAAAGATT CAGTCACAGC 
121651 TGTTGTTTTC GTGGCATTTG GCACCTCTGT CCCAGGTGAG AGTGAGAGGT 
121701 GCTTGAATTT GCAAAGAGGA TTTTACCTGG TTCAAATGAC CCCTGGACTC 
121751 CATCTCATTA TCTTCCACAC CATCTCAGAT CTGAACTTAA CAGAGCCTCT 
121801 GCCCTTAAAG TGCACAAAAG TCAATCAAAG AGATGAATAA TGACATTAGT 
121851 AATGACAGCT AATATTTCTT GAGCACTTTC AATGTGACAG ACACCATGTG 
121901 TGTTCAGCAA TTTACACATT TACATTTTCC CCCTGTAATG TTTCCCAAAG 
121951 CCCTATTAAA TAGGGTAAGT TATTATCCCC ACTTCACAGA CAAAGAAACT 
122001 GAGGCCCACA GAGGTTAAGC TACATGCCCA AGTAAGTGGT CCAATTTCTT 
122051 AACCTCCACA TTATGTGAGT AGACCACAAA CAGTGAAATT AAAAGAATGT 
122101 AGATATTGTT CTCCTTCTAT TTACCTCTGG CGATCTCTGA GAGGTTAAAG 
122151 ATTAGCCAGC TCAAAGATAT CAAAGGAGAA ATGCCCACAT ACATTCTTGG 
122201 CCTCCTCTAC TTGGAAGGAC ACTGTGAGTA CAAAGTATCT CCTAGCAGGA 
122251 CAGCCAAAGG AAGTTCCACA GCTTTTATCT TTTTATAGGA TGAATTACAT 
122301 ACTCTTTCTT TTTCTTAGGA ACACTCAGAG ACAAACAGAA AGGAGCGGAC 
122351 ATTCCTTTAC TCATTGAACA AATATTTACT GAGCACCTAT TATGCCTGTT 
122401 ACAGTATTGT GCTAGTTTTT GGGACTATAG TGAAAGGCAA GATACACATG 
122451 CTTCCTTCTC CACGTGGAGT TTATAATCTA CTGAAGGAGG CAACTCTCAA 
122501 CTACTGTAAT TAAAGTTATC TTGTTAAATC CTAGGAAGAA AAAGAAAAGG 
122551 TACTGCATAC GGAAGGAAGT TGGGCCTGAA TGTAGGAGTT AGCAGGTAGA 
122601 CAGGGGCTGC ACTAGCCCAG GTTCTTTACT TAATTCAGTT AGGGGCTTTG 
122651 GGGCCTCTGA ACTCTGAACT TCTGCCAGGG AGCTGGCATC CCAGTTGCCC 
122701 CAGAAAGAAA CAGAGCACAT CCTCCTGCAG GGAAGTTAGG CTGAATCTCA 
122751 TCAGACAGGA CTTTTCTGGC TGGGCCAAGG GAAATCTTTC CTGTACCAAG 
122801 CAAACATATC CTTCAAGAGA GTAGCTGAAT TCACATCAAA ' TTCTAGGAAA 
122851 ACCTCTTTCC AAAACCCCAG CGCAGGCCAG CGGTATTATT TGTCCATTAG 
122901 TGATGCAAGA GATTTAGCTA TCGTGGAAAT GCATCAGAAG GTTGGAAATT 
122951 AGATGGATGA TCCCAGGAAG GCCTGTGGAT GAGATGCCCT GTGATCTCTG 
123001 TTCTCCAAGC CTTGGGGGAC CTGAACTATC AGAGGGGAGG GAGGAAATAT 
123051 GGGGGAAAGC ATAGAGGTGG GAAGAAATAT CAGAGGATCA GAAGCAAAAA 
123101 ACAACAATAA CAACAGAAAC AAA A AC AAA C AAACAAACAA AAAAACAAGG 
123151 CCATAGGCAA GAAAGGGTAA GAGGTTTTCT CTGGGAGATC TAAAAAAAAT 
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123201 GGCAATAATG AGGTAAGCCA GGCAGATACC TTTGGGCATC TCCAAGTCCT 

123251 TGCAATTGGC CAAGACAACA GCTAACAACA TTTGAGGCTT TAAGAAGGTT 

123301 ACCCTGTGAT CCACTCATCT GATTTAGTGG CTTTGGCTGA AGCTCTTTGG 
123351 ATATAGTTGA AGGTACGGAA AGGGTCCTTA CATGAGGACT TTAGGGTCAA 

1234 01 GTCTCTTGCT AACATCCTAT GTGACCTTGG GTAAATTCTT TGACCCTTAT 

12 34 51 TTTTCTTACC TGTAAAATAA AAGAATTGGG CTAGATGTCT CTGACAGTCC 

123501 TCCCTGTATC TACAATCTGT GCCAAGATCT AAAGTCAAAC ACCCTGCAAG 
123551 GCCCTGTGAT ACATATATAA ACCACAAAGA CAGAGCCCCG TCTTCCTTGA 
123601 GTCCACAGTT CACCCTGCAT GTCCCCATCA TGGTTCCCCA ACATGTCCTC 
123651 TGTCCCCAAA ATCCAGCACC TCACCCAGTG CTCAATCAGT AGGCATTGCT 

123701 CAATAACTGT TGGTGGTTCG TGAATAAATG CCCCATATGA CAGTTAAAAT 

123751 CAGGCATCTA CTCCAAGCAG CTTCCCAGGG TGTCAAGGTT CCCTGGGGAG 

123801 ATATTATGGG ATGGCAAACT TCCCTTACTG AAAAAGTAGT CAAAGGAGAA 

123851 CAATAAGCCC ACTCAGTAAA TATCAGAACT GGAAAGCCCT TCAGAATCTT 

123901 TCAGATCACT GCAGATGAGG AATGGGAAGC JCCAGACTAGG GATGTGACCT 

123951 ACCCAGGGCC ACACGGCTTG CTTGCGGCAG AACTAGGAGT TAGGAGTGGC 

12 4 001 CCCCTAGCCC TTGTCTCTCA TTCCTGGGTT CAGCCCACCA GCTCAAGCTG 

124051 CTTTTTGGGC ATACTGGAAG ACAAGCCCTG CACACCTTAG CCTCCTACCA 

12 4101 GTTCCCATGT GTCTTTGTCC TTTTCCAGAT ACGTTTGCCA GCAAAGCTGC 

124151 TGCCCTCCAG GATGTATATG CAGACGCCTC CATTGGCAAC GTGACGGGCA 
12 4201 GCAACGCCGT CAATGTCTTC CTGGGCATCG GCCTGGCCTG GTCCGTGGCC 

124 251 GCCATCTACT GGGCTCTGCA GGGACAGGAG TTCCACGTGT CGGCCGGCAC 

124 301 ACTGGCCTTC TCCGTCACCC TCTTCACCAT CTTTGCATTT GTCTGCATCA 

124351 GCGTGCTCTT GTACCGAAGG CGGCCGCACC TGGGAGGGGA GCTTGGTGGC 

12 4 401 CCCCGTGGCT GCAAGCTCGC CACAACATGG CTCTTTGTGA GCCTGTGGCT 

12 4 4 51 CCTCTACATA CTCTTTGCCA CACTAGAGGC CTATTGCTAC ATCAAGGGGT 

124 501 TCTAAGCCAC ACAACAGAGC CTCCAGCAGG GCAGGCCTAG GACTTCTCCT 

124 551 AAGAGAAGGG CACTTCCCCA CCAGTGATCT CTCCCGACTG CACTGCCCTG 

124 601 GAGAGGCAGC ATCAGGACCT AAGCCCCAGG AACTTCACCC AACTTAGGCC 

12 4 651 CTGGCAATTA ACTGAAAGGG CAAAGTCTTA ATCAATCAAA CAATGGAGGA 

12 4 701 ATCACCGACT TTACACAGTA TTTAATTGAA TACAAACAAG CAACAGCAAC 

124751 AAATCCACCT CCACCCCATC TCCCCCTCAT ATCCCTGACC CAAAGCAAAG 

124801 GTCAGAGCCT TTCGCCTCCT TCTATTCCAT CTTTTGATTA TTCCTTTGCC 

12 4851 TCTCATTTCT TTGGAAGCAG GGTTTCTCCT "GTCTGCCCAA T2CCATATGT 

12 4 901 CCCTATTATC TCACTCAGCT GACAAGACGT GAAAATGAGT CACATTCATG 

12 4951 TGGCTGGGGT GGGGTTCTTT TTTCATTGTA ATCATTATTG TGGTTGCTTT 

12 5001 CGTTTTGCCG TTAGGTTTTG CTTATTATTT TGTTTTGTCT TTTTTTTCTG 

12 5051 AAGTGAGTGA AAAAGGTGCC ACAAAGGAAT TCCAGGTCCG AGCCAACAGA 

125101 GAGAAACATG AATTTTTAGA CACATGCTCT CCTGCCACCT CTTGGCTCCA 

125151 TCAAGATCCA GTTCCCCATC TCACTGTTTT CTCTGAGTTC TTGGGAGGAG 

12 5201 TGATGGTGTT GGGGTAGAAA TAAGCTCACT CACCCACGCA GGGTACTAAA 

125251 GATCTTACAG GAGCTTCAAC TGGAGCAGGA GGAGCTTTTT ATGCTTATGT 

12 5301 TGAATCAAGT CAGATACAAA AAGCAATTGT CCCTCTTTGC CCAAGCCTTT 

12 5351 CCAATTCTGT GTGTCTTGTT GTGTCAGTGT CCACTTGTGT ATCCTTCTGC 

12 54 01 AGGAAGACCC GCCAAATAGA AGAGATGGGA CAAAAATAGG AATGGTGTGT 

1254 51 GACGACAAAG GGCTACTGGA AGAACAAAAG GGATACAGGC CTTCTTGATT 

125501 ATCTTTGGCT TTGTACCTGA GGCAGGAGAG AAGAGATGTC CAACCAGTGA 

125551 GATCTTTAAG AGAAAAGTTT GTATTTTAAA TGTCAATGTG CCTGAGAAAT 

125601 GTCAGCTTCA CCACGCTCTT GCTTCCTAAT GCTCTATACA AAGAGGGCTG 

125651 ACTATATTTC TTGAAGTGGT GTAAAAACTT AGAGATTTTA TAAGAGAACC 

125701 AGGGGCTCCC TTCACCTCTC CTGGTCCCTC AGGTCACATA TGAAAGCATT 

125751 TTTACAAGAT AGGAACTGGA ATTCCTCATT TCTCCCATGT TCCTGCTTGT 

125801 TCTTAAACTT CATGAAGCTA TTTTTCCAGC CTATGGGGTA GTTCTTGCTC 

125851 CAGTAAGAGG AATCTTAGTT GTCATAATCC CTTGGAGCCT GGGTTTTTGG 

125901 AGAAAGAGAT CTCCGTGCCC TACAGACCTT TTCTCAACGA ATGTGGGAAG 

125951 GACCTGGCTT TAAAACACGC ACACAAACAC ACAAATAAAC AGACATAAGA 

12 6001 TGTCATCACG AAACTGCCCA CGGATCTTTA GGCTTTCTGC ATTGACATAA 
126051 ATACATTTTC TAAGGGGGGG GGGGAAGAAA TTAAAAAACA CCTGTTAATT 

126101 TTAAACACAT TTTTTAAGAA AAAAATAATT AAAAAAGAAA CAGTGCTCAT 

12 6151 GTCATAAGCT ATGTTGACAG TTGCCAGTGG AAATGTTGGG TTGGTTCAAA 

126201 AAAAAAATAA AAGCTATACT ATATCTCTCT ACATACAGCT TGCTTCTACC 

126251 TGTGTTTCTT CAGTGAAAGG TCCAGGGGGC CACTGTGGGC TTCTTGTGAG 

126301 GAGACGTGAC TCAGGTGAAG GTGTCACCTC CTCTCACACT CAGGTGCCAA 
126351 TGTGTCAGAC CCAGTATATT CTAAGCAAAA ATACTTCAGG AAAATGCCAC 

126401 TTGTCAAAAC CTGGACTTTG CGAAGTTGGA AGATGTAAGT AGTAGTAAAA 

1264 51 GCTGTGGTAA TTATGGAGGA AGGAGGTTTC TGTATCAGAA AGGCATTGGC 
126501 CGTGACAGAC TC 
(SEQ ID N0:3) 
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378 



742 



2005 



2381 



5165 



5402 



6794 



TGGCATGTACAAAGGTCCTGGGGTGGACAGTCACTTGGTATAATCCAAGAGTGAACCTGA 

AGGCTATTGTTGTTGAAATGTAATAAGGGAGAGAGTGACGGGATGAAGGGGGATGAGTGG 

GAAGCAGTGAATTCCTGCAAGGCTTTGAAGGTCATGGGAAAGAATTTGGTCTTTATATCA 

AGAGCAAGAGAAGACTACTAAAGGGCTTCAAACAGGGGAGCGATATGCTTAAGTCTGTTT 

GTTTGTTTTTTTAAAAAAAGATTACGGTGGCTATATGAGGAAAGTGGAATTGAGAACTAG 
[C,TJ 

GAGAGTTGGAGTGGTGAGCTCCATTAGGAGGCTACTGAAGTAGATTCATGAGGTAAGGAG 
TGATGGTGGCCTGGGCTGGGATGATGGTGGTAGAAATGGAGAAAGAGTTGATAGGATTTA 
GTGATTGGATAAGGGACAGAAGAGAGATGAAGGCTTTCAGACTAACATCTGCTTTCTAAC 
ATGAGTAACTGGGTGGCTGAAGATGCTATTTTCTGAGCTGGGAAACAGGAGAAAAAGGAG 
CAAATATGGGGGATGAAGACTTTGAGTCTTTAAGGTGCTGTACAAACACAAATCAGCATT 

TG GTGGC CT GGGCTGGG AT G ATGGT GGT AG AAATGG AG AAAG AGTTG AT AGG ATT T AGTG 
ATTGGATAAGGGACAGAAGAGAGATGAAGGCTTTCAGACTAACATCTGCTTTCTAACATG 
AGTAACTGGCTGGCTGAAGATGCTATTTTCTGAGCTGGGAAACAGGAGAAAAAGGAGCAA 
ATATGGGGGATGAAGACTTTGAGTCTTTAAGGTGCTGTACAAACACAAATCAGCATTCCT 
TTATTACTAAGGGTATCCCACACAGTTGTAGCAGAGGGAGAAAGATCGCCCCCCCCCCAC 

TTTTTTTTTTTTTTAGCTATTCCATGGTATTTTCATTCTCATCCCACCCAAATGAGGCAG 
TGAGTGCTAAGATCAvTTATATAATAGTTTCAATTGCATTTCATCCCATTCTTCTGAGCTC 
AAGCTCACC T T ? t ACTGGTTTGAGGCCAGTAGATGAAGCTGCATATCACCCCCAAAATCT 
TGTCTi . AG TTT AAC AAAACTT ATTTGAGAGACATTTGCATGTTTTATTAATAATGATTT 
TTACCATTTCTT-rrTTTCCATCTTTGGGTTTGAAATTTGAGTGGCTGGCGGATGATCATC 



TTTCCAT»*'."f.'v;A.:T ATTCCC AGCTATTTCAAGCCATTTTTCAACGGAGTCTCCACCAGAT 
GGTTTGCIAv"^*»ACA-..A^ ACTTATTTCTGCCTCCCATTGACATCTATTTTTCCAAGTGAGA 
GACTGCCCCA TAT -T- > AOTw,.AATATGTCACTGGAGGTGAAGCATCAGTTGTATTGGTGG 

gaacctgc c tt , - ,r~: c : ; - ttt rrcc tcatgccttttcc tgcc tctctgatc ttttc 

TAGGTCTCT^JVTAT A^~ -^ACAACTGGTGCTGCAATAGAAGCCAGTGGCTAAGTCT 
fC,T] 

GTGTATGGC C T S;.T 7 AA3C7T < V.' A.VZ CTCTCACCTCTGCCTTCCTCC ATTTTGGGCTGGT 
TACCTTTCTGCTTTT-VTOAAT;- •T'JTTCGAGCAGAGGCTGGTGGCTCAGGGGACGTGCC 
AAGCAC AGGC iC A« . AA ~ AATCVA07CC7CTTCAGGGTCATCGGACTGCAAGGAGGGTGTCAT 
CCTGCCAATCTt^.iTA '."iV-A ~*AACCCTTCCCTTGGGGACAAGATTGCCAGGGTCATTGT 
CT ATTTTGT G^" C CT S A7 A T A : A T ^ TTCCTTGGGGTGTCC ATC ATTGCTGACCGCTTC AT 

CCTGAATGCTCTTtCAOZAOAV. -CTGGTGGCTCAGGGGACGTGCCAAGCACAGGGCAGAA 
CAATGAGTCCTC TT JAoSTTC ATCGGACTGCAAGGAGGGTGTCATCCTGCCAATCTGGTA 



GATAT AC ATCTT C CTT kTGGTGTCCATCATTGCTGACCGCTTCATGGCATCTATTGAAGT 

CATCACCTCTCAAGAGAGGCAGGTGACAATTAAGAAACCCAATGGAGAAACCAGCACAAC 
[A,C] 

ACTATTCGGGTCTO;AAT5AAACTGTCTCCAACCTGACCCTTATGGCCCTGGGTTCCTCT 
GCTCCTGAGATACTCCTCTCTTTAATTGAGGTGTGTGGTCATGGGTTCATTGCTGGTGAT 
CTGGGACCTTCTACCATTCTAGGGAGTGCAGCCTTCAACATGTTCATCATCATTGGCATC 
TGTGTCTACGTGATCrCAGACGGAGAGACTCGCAAGATCAAGCATCTACGAGTCTTCTTC 
ATCACCGCTGCTT'oGAGTATCTTTGCCTACATCTGGCTCTATATGATTCTGGCAGTCTTC 

TTCCTCTGAATGACTGAACATATCCACAAATAATAAGCGTGGCAGGAGATGGTGTGAAGA 

GTAAAAGGAGCATATAGGAAGTTGTGTGTGTGGGGTGTCTGTTTCAAGAACCTGCTAATT 

ATACCTTCAGTAAGAAATGAAGCCATACAACCTCTAGAAGAGGAGGAGGAAGGAACTCAT 

GGAAAAGTGGGGAGCCATAGAAGCTAGGGAGAGGTGTCCTAGGAGTGCTTCTGCCCAGGT 

CCAGCCATGAGACAGAGCTCAAAAAGAGCTGGGCACTGCTGGTGACAGAACTGAGTGACC 
(C,T] 

GGGGGATCCTGCATCTGTTCTTACTCAATCCCTTCTTAATAATGTGACTTGGGGCAGGTC 
ATTTATTGGTTCTGGAACTTAACTTTCTGATATGCAAACTGGGAATAACAATACTTTCCT 
TGCCTGGAGGCAAGGTCAGTCCTTTTTGCAGTTCCTTCCAGCTCTAAGATTTTCTGAACC 
ATAGACATAAGCACTCAGTGTAGGTCATATTCGCACTTGCCAAAAATGGATCAGGGAATA 
TTGTCTCCTGAAGGGAAATGGCCATTGACAAATTGATTTATTAGAGCTCTGTTTAGTCAT 

GGTCCAGCCATGAGACAGAGCTCAAAAAGAGCTGGGCACTGCTGGTGACAGAACTGAGTG 

ACCCGGGGGATCCTGCATCTGTTCTTACTCAATCCCTTCTTAATAATGTGACTTGGGGCA 

GGTCATTTATTGGTTCTGGAACTTAACTTTCTGATATGCAAACTGGGAATAACAATACTT 

TCCTTGCCTGGAGGCAAGGTCAGTCCTTTTTGCAGTTCCTTCCAGCTCTAAGATTTTCTG 

AACCATAGACATAAGCACTCAGTGTAGGTCATATTCGCACTTGCCAAAAATGGATCAGGG 
[ A, G) 

ATATTGTCTCCTGAAGGGAAATGGCCATTGACAAATTGATTTATTAGAGCTCTGTTTAGT 
CATTTTGCTGGGAAGGATAATCATTTGTTAACGTAAGTAGAAACCTGTGCCTTCTGGAGA 
ATACTATCCATTTATATGTACTCTGGGGAGAGTGTTTATACATACAAATGAAGGACAGGG 
CTTCACTGGGAAAACAAACTCCATGGAATTTCACATGATTATCGCGATGTCAGTGTGGAA 
GAAGAT ATGGT AAGGC ATT AAATGAC ATT AAGACCACAAAATTTGCCATAATTTGACGGA 

CTCATAAAATATTAGAGCTAGAAAGGACCTTAGAATATCTTCTGCAGTCATGGTTCTTAA 
ATTTTAATGTGTTGCTCAATCATCCAGGGATCTCACTGAAGGGCAGATTAGGATCCAGGA 
GGTCTAGGGGAGGGATTGAGATTCCGCATTTCTAACAAGTTCTGGATGCTGCGGGCCCCA 
ACTTAGAGGTGAAAGGTTCTGAAGCTCTTGACCAAACCAGGAGACCCAGCAAAGAAGTGG 
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TTTTTCAGACAACTTGCTTAATTGAATAATGATTGTTTGCTCTTTAATTCCAACTTTCAA 
[T,CJ 

GCCAATTTAGCAAGAACCAGAGGCTGTGCTAATTGCCACACCAGTCTGGAAACCGAAATG 
G AT AG CTTC AGG GT AC TTG G AC AAAGTTGG AACATCTGCTT TCT AATC TCTCC CTCTT TG 
TATAGCTTTATTTGCCTACCAAGCCTGGTAGTATTGAAAATCTGCCCTCACTATACTCCC 
CTAAATATAATCAAGTTGAGGCCAGGCCTGTGCTCTATCAATAATATAGGATCCACGAAT 
TCACATGTTTGGTTTTATGCTTTACTTCTTCAAAGGTGCTTTTAGCAGCATGGAAGAATG 

988 3 GTC AAAG AATATGTCAAAGCATGAC ATATTCCAACTCC AGGATCCATAAAACACCCCAAG 

TTCTGTGGAGACCCTATCACATCTGCAAAACTCTCCAGGAAGTCCAGAGCCCTCCTGGTT 
AATTTGTTTTAGGGACTAGGCATGCGGTATCCCCTGACAACACTGGATCAGCAATTCTCC 
TACCTAAGTCAGTCCCACACCATGTGCAGCAGAGTATCCAGTGCCCCTGCCCTGGTCTGC 
TCACATTGGTTTGCTCTCCAGAATAATAATTCCTCAATATCCACAAGAGATTGATTCCAG 
IA,G] 

ACTACTCCGAGGATACCAAAAATCCTCAGATGCTCAAGTACCTGGTATAAAATGGCACAG 
TATTTGGCATATGACCTAGGCATATTCTCTCCCATATACTTTATTTATTTATTTATTTCG 
GCACAGAATCTCATTCTGTCGCCCAGGCTGTCACTCGCTTATTGCAACCTCTGCCTCCCA 
GGTTC AAG C AATTCTC C T GCCT C AGCCTCC T AAGT AG CTGGG ACT AC AG ACGC ATGTC AC 
CACGCCTGGCTACTTTTTGTATTTTTAGTAGAGACAGAGTTTCACCATGTTGGCCAGGCT 

1 Or i 0 C AGATGCTCAAGTACCTGGTATAAAATGGCACAGTATTTGGCATATGACCTAGGCATATT 

CTCTCCCATATACTTTATTTATTTATTTATTTCGGGACAGAATCTCATTCTGTCGCCCAG 
GCTGTCACTCGCTTATTGCAACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCTCAGC 
CTCCTAAGTAGCTGGGACTACAGACGCATGTCACCACGCCTGGCTACTTTTTGTATTTTT 
ACTAGAGACAGAGTTTCACCATGTTGGCCAGGCTGGTCTCAAACACCTGACCTCAAGTGA 
IT, C| 

CCGCCCACCTTGGCCTCCCAAAAAGCTGGGATTACAGGCGTGAGCTACCACGTCCAGCCC 
CCCATATACTTTAAATCATCTCTAGATTACTTATAATACCTAATACAATGTAAATGTTAT 
ATAGTTGTTTTAATGTATTGCTTTTTTTATTTGTATTGTTTTTTATTGCTGTATTATCCT 
TTTTTATGTTTTATTTTTTCAAATATTTTCTACCCGTGGCACCCACAGTTGGTTGGTGGA 
ACCTGCGGTTGGTGGAGCCCATGGATGTGAAGGGCTGATAGTATGAGAAAACTCAGAGGT 

122 20 ACATCCAAATAGTAACTTAATATTCCAAATATGGCTGCAAAACAAATTGTCGATTATGGA 
TGACTACTACTGCCATCTCTCCATACCAGTCCATCTTCTGCCAGGCTGTTTGGTCTTGAT 
TTGTCGACCTTTTAGGTTTCTCCCCATGTATTCCACATGACCTTCACCAACCCCACTTCT 
ATCTCCAAACGTCTTTCTGAGTTGTGGGGATGCAGATGTATTCTGCCACCATCACAAGGG 
CTAACCGAGCCCTGGCTGCGGATCTTCATTGTTGTTCACATTATTTCCATTCTTACACCC 
(T,G1 

ACTTCATGTTTGTACACTATTTTCTTACATTTGCTGTCTCTTCTAAACATTCTTTGCTGC 
ATCCACTTTTTCTCTATTTGTGCTCTAGGTGCTGCAGAGGCTAATGCTGGGTTTCCTTTC 
ATTCCTCCTTGCACTCAGCACCTCCCTTCTCAATTCCTTTTGCCATGTCTCCACTTTAAA 
TCTTAACCTACTCCAGATAGTCTTTTCCTTCACACTATTGGCATCTGTGCTTGGGTTGCT 
TTCAGTCTATTCTCTGATCTATGATTTCTTTGCATGATCAAGAAGGTGCCATGAAAGGAT 

13842 TCACTTTCAAAGCCTCTTTCTGGGTTTGGATTTCCAGAGCAGCCTGTGCTGTAAAGCAAG 
ACAGAAAGCTTCCCTGCCATTCATGCCTGCCAGGGATAGAATGACAGTACTCCTGAGGCT 
CTCCCTCCCCACCCCTCCCCTGCTGGACAGCTGATCTGCTGGACTCAGCCAGAGCCAGCA 
GGCACCCCCTCTTTATCCTAGGAGCTGCAAACTTGATGCCTTTCCAGGAAATCCCCAGAA 
GCTGGAGTATCCTCATCTACATGTGGCACAGTGTATGGTTGTGTCAGGTGCTCATGTCCC 
(G,A) 

TTGCATAGGACTGGGGTGGAAAATAGGGACCGTCCTTTTGTGTCAGCTCCAGTCAATGAG 
TAGTGGCCATCCAGGGGGCCATCTTGGAAAGGACTTGTGAGGCTGTATCTGCGCTCAGTT 
GTAGATGTGAGAAGAAAAGGCCAAATATCTGCCAATCCTAGTCCTGGGATTCAAGATAGA 
AAGAACTGCATGGAGTGAAGAAACTAGGAGTCTCCATTTCACTGAGATGCATAAGAATGA 
AATTATTGTCACTATTTCTTCAATACTGGGCCAATCCTAATAAGAAAACCCTTTTTGAGT 

14200 GAGT AGTGGCC ATCC AGGGGGCCATCTTGGAAAGGACTTGTGAGGCTGTATCTGCGCTCA 

GTTGTAGATGTGAGAAGAAAAGGCCAAATATCTGCCAATCCTAGTCCTGGGATTCAAGAT 
AGAAAGAACTGCATGGAGTGAAGAAACTAGGAGTCTCCATTTCACTGAGATGCATAAGAA 
TGAAATTATTGTCACTATTTCTTCAATACTGGGCCAATCCTAATAAGAAAACCCTTTTTG 
AGTCTCTCTTTTCTTTATCCTACATATAACACAGAAGCTTTTTCTATTCCCTGGATGAAC 
CC,A] 

CACAGGGACAGAAATTCTTGTTGGACAGGTGAAGCAGATAATTTCTTTATCAGACTAGAA 
TCTTCCAGAAGCACTGCTAACCTAGTGAGTTTTGTACTCTAGACAGGTGGTTCTCAAGCC 
AGCTCCCCACCGCAGGCCTTTTTCATGGTCTGCCCCTCCCTGTGGAACCCATGTTTTAGG 
TTATTAGCTGATAATTGGATTTCTATTTTTTCTCATAAAATACAGCAAAAGATAGCTAGT 
GATATTATGATGAGTTAATGTAATTATAGCCAAAGCAGAGAGAAACAACATTTTAATTAA 

15878 TGTGTCAAAATATCACTCTGTATCCATACATATGTATAATTATTATGTGTCAACTAAAAA 
TAAAAGGAAAAAAATCATTTCAGTGTATTTACAAAACATATGTAACCATTAAGAATAATG 
T T TT A AATT AT ATC T AAGGGTGTG AT AAAATT AC AGT AT AAG ATTGTGCTTG AAAAAGTG 
C AAT A AG AAGT AA AT ATGT AC AG AT G AG AAAAAGT GC AAAG AACT AAGTCCTAAGCAG AC 
TATACCTTTCCTACTGCATGGTACTTCTCTGGCCTTTTGCTTTGAAAGATTTTGCACCCA 
(G, T) 

CATGGCAAGTGGTTAGCAGAGGCAGCCATTCTCACTTGTGCGTTGGCTTTGGGAGCCATA 
TATGTTGTTCAGCTGGGTGTGGAGTGGAAAGGCTGCATGTTGTATTAATGCATTGTTAAG 
AACCTCTAAGAGTGATTTCTTTTGGGAAGTGAGACTGACGGTCCGAATGGTGGAAAGACA 
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ACTTTTAATCTTTTACTTTACACTTTGTGCACTTTTAAATGTTTAACATGAGCATGCATT 
TCTTTAATAATAAAAATACAAAAAAATTTTAGCCCTAGATCTTCTGATTTTAAACTGCAT 

16030 ACAGTATAAGATTGTGCTTGAAAAAGTGCAATAAGAAGTAAATATGTACAGATGAGAAAA 
AGTGCAAAGAACTAAGTCCTAAGCAGACTATACCTTTCCTACTGCATGGTACTTCTCTGG 
CCTTTTGCTTTGAAAGATTTTGCACCCAGCATGGCAAGTGGTTAGCAGAGGCAGCCATTC 
TCACTTGTGCGTTGGCTTTGGGAGCCATATATGTTGTTCAGCTGGGTGTGGAGTGGAAAG 
GCTGCATGTTGTATTAATGCATTGTTAAGAACCTCTAAGAGTGATTTCTTTTGGGAAGTG 
[A,G] 

GACTGACGGTCCGAATGGTGGAAAGACAACTTTTAATCTTTTACTTTACACTTTGTGCAC 
TTTTAAATGTTTAACATGAGCATGCATTTCTTTAATAATAAAAATACAAAAAAATTTTAG 
CCCTAGATCTTCTGATTTTAAACTGCATATTCTTTCTATTGTGTTACATATTTTAGCATG 
AGAATAAGGTTATGAAGCTGGAAGTAGCAGGCTCCCTTTTCCTCATATGTAGGAAGTTAA 
G AATGC AT TC T AC GT TTC TTC T TT AAGG AGTTGGCT TCTTTCCTTTT AAC AT AGGGGT AA 

16292 TGTTAAGAACCTCTAAGAGTGATTTCTTTTGGGAAGTGAGACTGACGGTCCGAATGGTGG 
AAAGACAACTTTTAATCTTTTACTTTACACTTTGTGCACTTTTAAATGTTTAACATGAGC 
AT GC ATTTC T TT AAT AAT AAAAAT AC AAAAAAATT TT AGCCC T AG ATCTTCTG ATTTT AA 
ACTGCATATTCTTTCTATTGTGTTACATATTTTAGCATGAGAATAAGGTTATGAAGCTGG 
AAGTAGCAGGCTCCCTTTTCCTCATATGTAGGAAGTTAAGAATGCATTCTACGTTTCTTC 
[T,CJ 

TTAAGGAGTTGGCTTCTTTCCTTTTAACATAGGGGTAACTGGGCCCAGGGAGTTTGGCAA 
GGGCCAAATAAAGTCCTTAATGCCCAGCTCAGAAATCTGGATTCACCATCCTTGACTGCT 
GGCTCCAACCCACCCTCACCTGAGCTGGTCTGCAGAGGATTCTTGTTTGTGTCACTTCAT 
CACCAGCAACTACCGACAGATGATGCTTTGGCCTGCTGCCTGGGTAACAGGGCGAGGCTG 
GCTCAGGACCATGTTTTCAGATCAGGGGACCTCCTTTGATGCCATGTCCATGGTGTCCGA 

16506 GCATGAGAATAAGGTTATGAAGCTGGAAGTAGCAGGCTCCCTTTTCCTC ATATGTAGGAA 

GTTAAGAATGCATTCTACGTTTCTTCTTTAAGGAGTTGGCTTCTTTCCTTTTAACATAGG 
GGTAACTGGGCCCAGGGAGTTTGGCAAGGGCCAAATAAAGTCCTTAATGCCCAGCTCAGA 
AATCTGGATTCACCATCCTTGACTGCTGGCTCCAACCCACCCTCACCTGAGCTGGTCTGC 

AGAGGATTCTTGTTTGTGTCACTTCATCACCAGCAACTACCGACAGATGATGCTTTGGCC 
CT,G] 

GCTGCCTGGGTAACAGGGCGAGGCTGGCTCAGGACCATGTTTTCAGATCAGGGGACCTCC 
TTTGATGCCATGTCCATGGTGTCCGAGGGCAGCCAGGATCAAGGGCTAGACGGGGCAGTG 
ATGAGATGAGAGCAGGAGGGGCTCAGCTGCAGCCCCAGGAGAGCCTATGCCAGCCCTGTT 
GACCAAGGAGGACAGAAGCAACAGGAGAGCGGAGGCAGAGGGGTGAGTGTCTATCGCTCA 
ATGTATAATCGGCAGACATTTGGGGAGCTCATACTGTGGGCTAAGCACAGGGAAGAAAGG 

17953 G ATTGGACGC AGTTCTGCAC AGC ACTTTTCCGAATGCCTCTG AAATGAGTCCTCACTGAC 

AGAACGGGCCCACTCTGGGGGAACTGAGGGCTCTCTTGGTCCTGCACTGCTCTTTGCCAT 
ACAGATCTGTCTGCCCAGGATTTTTCTTGGGTGTGTAGGAGGCTGAGAGAGCTCCCCTTT 
CTTCTCATGGCTAAATCCCTTGGTCTTTCCAGCCCTCCTGGGGGTTAGAAGGGAGAGGGA 
AAAAAAAAAAGACTGAACTTGTTGTTGTTGTTTTTGTTGTTGTTGTTGTTTGCCTGTTTT 

rc,A] 

T ATGT T GT CT TGTGGGG AG AGGGT AT AAG AT TG ATT G AC AG AGTGGC AC ACTTCC CC TGC 
AAATTCATCATTTGAATTTCTCAGGTAAGATGTTCACATTTCTCTGTTAAGATGCTCCAA 
TTTCTCTGGTTAAGATTTCTCTGGTAAGATGCTCATGAATTGGTGGAGGTGTTGGCGGGA 
TGTGGGAAGTGTGCCTGCTCTTTCTGAGTTTTGGGGGAAGTTGCCTTAATTCTCTGCATG 
ACTTTCTTTGCTCCTTTGGGCTTCATTTCTGTGCAATGTAGTCTGACATGAATACTGCTC 

23832 T AGGC AAC AGC ATT AT AACTCCTGCCTTCAC AAAGCTT ATCTAACACAC ACATTTCCTCC 

TCAGGCACATCCCAGCCTTCTTGCACTTAGGATTCAGCAGTATGCTTAAGGGCCATTTTC 
AACAGCAAACTCATCAGCGCAAACACAAACATGTGAAAAACGTAGCACTAAAGAGACTGC 
AAAAAGG AC ACT GGC TT AC AGC AT GG AAGCTGG AAGG AG AAGGC AG AG AAT C ACC TTGTT 

CCACTTCAGCTATGAATATGCAGTCAGGCCACCCAGTCATTCAAATTTTATAAATATACT 
[C,G] 

TAATATATATATAAATACCAGGCAGGGTTATTTTTTTCCTCAAGTCATTTTTCTAATTTT 
TTTTAAATGAATAGATAGAAGAGCTGAAGTAAGGGTCAGGAGCAAGAGCTCTGCTTCCTT 
TTCCCTTGCTGGGCTTCGTTAGAGAGCCATCATCTCCTCAATATGTCTCCCAACTCTTCT 
AGGCATTGGATGAGTTTGCTGCAGATACGAAACCCAACTTTGCCAGTCACTTCATACTAA 
CAGGTGAAATGTAGTGGAGGAGCCTTTTGAAGACAGGGACTCAGCCCCCCATTAGCCTCA 

25001 GGGC AGATG AGTTT ATACGTTTCTTTC ATGTCCCCTTCCTCCC AC AT AG ACT TTT ATTTC 

CCCAAAGGAAAACAGAAAACAATGATCTGTTTGACAGTGTTGCTATCATTGGGCATCAAA 
CCTATCATCTAAGGGGAATCCCCCTGTATAATCAGTCAGCCAAATGGAGCAGGACCCTGT 
GTTTTGTAGCTGATACAACAGGGCAGCATCTCTAGTGAGGGGGCCAGGGCTTCTATTTCC 
TTCATTAAAAAATGAAACAGCAGACCTGATTCCATATTTAGAGATTACACTTAGTTGCCA 
[C,A] 

TGTGGGTGTGCAGGCACCAACCAAACCCAGTTGGCACCGTTGTCTTTTCTCTGCAATGAT 
GTATTGAATTTAATAATGGAGGTATATGAAATTCAGAGTGATTGGAACTGAAGGTTTAGG 
GGCTTTGTGTAAAATTGATATGTAAGGGATTTGGAAGTAGGTGAGGGATTCTTCCCCAAT 
ACTTATTCAATTTTGGAGTCAAATAACCAAGCATTTACAAATAGCCAAAAAAGAAATTGA 
AAGAGGGTTTAATCCAATAAATTTTCATGCCTCATATGAACCACATCTTATAATAAGAAT 

25141 CCCCTGTATAATCAGTC AGCCAAATGGAGCAGGACCCTGTGTTTTGTAGCTGATACAAC A 

GGGCAGCATCTCTAGTGAGGGGGCCAGGGCTTCTATTTCCTTCATTAAAAAATGAAACAG 
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CAGACCTGATTCCATATTTAGAGATTACACTTAGTTGCCACTGTGGGTGTGCAGGCACCA 
ACCAAACCCAGTTGGCACCGTTGTCTTTTCTCTGCAATGATGTATTGAATTTAATAATGG 
AGGTATATGAAATTCAGAGTGATTGGAACTGAAGGTTTAGGGGCTTTGTGTAAAATTGAT 
[A,G] 

TGTAAGGGATTTGGAAGTAGGTGAGGGATTCTTCCCCAATACTTATTCAATTTTGGAGTC 
AAATAACCAAGCATTTACAAATAGCCAAAAAAGAAATTGAAAGAGGGTTTAATCCAATAA 
ATTTTCATGCCTCATATGAACCACATCTTATAATAAGAATTATGCTTTTTCATTTCATAC 
TCAGTTAACAAATATGATTTGTGAGCACCTGGTAAGTTCAGGGCACTAGGCTGAAAGGGG 
TTACGAAATGTCTTCATTTAACAAAGTCCAGCTGAGCTCTTACAGGTACCAGAACTGTGC 

25191 TGATAC AACAGGGCAGCATCTCTAGTGAGGGGGCC AGGGCTTCTATTTCCTTCATT AAAA 

AATGAAACAGCAGACCTGATTCCATATTTAGAGATTACACTTAGTTGCfcACTGTGGGTGT 
GCAGGCACCAACCAAACCCAGTTGGCACCGTTGTCTTTTCTCTGCAATGATGTATTGAAT 
TTAATAATGGAGGTATATGAAATTCAGAGTGATTGGAACTGAAGGTTTAGGGGCTTTGTG 
TAAAATTGATATGTAAGGGATTTGGAAGTAGGTGAGGGATTCTTCCCCAATACTTATTCA 
[A,GJ 

TTTTGGAGTCAAATAACCAAGCATTTACAAATAGCCAAAAAAGAAATTGAAAGAGGGTTT 
AATCCAATAAATTTTCATGCCTCATATGAACCACATCTTATAATAAGAATTATGCTTTTT 
CATTTCATACTCAGTTAACAAATATGATTTGTGAGCACCTGGTAAGTTCAGGGCACTAGG 
CTGAAAGGGGTTACCAAATGTCTTCATTTAACAAAGTCCAGCTGAGCTCTTACAGGTACC 
AGAACTGTGCCTGGGCTGTCATATGAAGATGAATGTAAGAGTGTGTCAGGCCTTCAAGAG 

2 6147 GC ATGATCTCTGCTC ATTGC AACCTCTGCCTCCAGGTTCAAGCGATTCTCCTGCCTCGGC 

CTCCTGAGTAGCTGGGATTACAGGCGTGTGCCACCATACCCAGCTGATTTTTGTATTTCT 

AGTAGAGATGGGGTTTTGCCCTGTTGGCCAAGCTGGTCTCAAACTCCTGACCTCAAGTGA . s 

TCTACTCGCCTTGGCCTTCCAAAGTGCTGGGATTACAGGCATGAGCACTGTGCCTGGCCT 

TTTTTTTTTTTTTTTTAAAAAAAAAAAAAAAAAAAACAGGAAGTTTTCGTTAGTTTTTTT 

l-,A,G] 

TTTGTTTTACTTCCCATAAAAACTCTTTGTGTCACATGGAGGTGAATGGAAAGAGAGGCT 
GTGGC AAC AG ACGGG AG AC TT TT CTG AT ATC AG AACCC AGTCCC AT AGACC AG AATGT AT 
GCTTTCAATCCACGTTGTCTGGGTCCATCCTATTGAGTGCCCTGCCCCCACAGCGGGGTA 
TGGAGAAGAGTCAGACACAGCCCCAGTCCTCACGTAGCTCACAATCCAGTGGAGGAGACG 
GACTCAGAAACAGATAGAGATGAAGCCATGAGATCAGTACTGTCCGAGGCCATGGCCACG 

27 400 TAAACTTTACAAATCCTTAATTTGTAAAATGTGGGCAATGATAGTACCTCCTCACAGGAT 
TATTACGAGGTTTACACGGAATACTCTCAGCTCATAATAAGCACTTGCACAGGCCTCATG 
GGCTAGGCCCTCAAAACTTAACGCATCTACAGGCAACAGCCATATGAAAGGAATTTTATA 
CCACCAAGTCAAAAAATCTGTGAGCACTGCTCAGAAGCAAAAGCCTGTCTCCAACAGCGC 
TCATTTAAGGGGTGGGCGAGCTACAGAGAGAAGAATGAGCCCCCACAGGGTAAGCTGGGG 
[A,G] 

AAGCTGGGGACAGAATGAGACTCAGGAAATCACTTGAATATTGATTATATTTGTGCTCAA 
TAATAAAATAACGAAATGAGTACAGCCCTAGACCTAAACATTGTGGGTGAGGCAAAGGCA 
ATGCGTTAATTTTGCATCCACTGAGGAAAAACTCTAAAACGGTGACTTCTTTTTTAAGGG 
ACCAGAAGAATCTAGATTATATTTAGTCTAAGTCAATACATACGACAGAACCTTGCCCTC 
TAGACTTGATAAGAAAGAAGTAAAATAAGAGAAAGAATAAAAAACCCTTCCACCAAAATA 

27 401 AAACTTTACAAATCCTTAATTTGTAAAATGTGGGCAATGATAGTACCTCCTCACAGGATT 
ATTACGAGGTTTACACGGAATACTCTCAGCTCATAATAAGCACTTGCACAGGCCTCATGG 
GCTAGGCCCTCAAAACTTAACGCATCTACAGGCAACAGCCATATGAAAGGAATTTTATAC 
CACCAAGTCAAAAAATCTGTGAGCACTGCTCAGAAGCAAAAGCCTGTCTCCAACAGCGCT 
CATTTAAGGGGTGGGCGAGCTACAGAGAGAAGAATGAGCCCCCACAGGGTAAGCTGGGGA 
[A,T] 

AGCTGGGGACAGAATGAGACTCAGGAAATCACTTGAATATTGATTATATTTGTGCTCAAT 
AATAAAATAACGAAATGAGTACAGCCCTAGACCTAAACATTGTGGGTGAGGCAAAGGCAA 

CCAGAAGAATCTAGATTATATTTAGTCTAAGTCAATACATACGACAGAACCTTGCCCTCT 
AGACTTGATAAGAAAGAAGTAAAATAAGAGAAAGAATAAAAAACCCTTCCACCAAAATAC 

29278 ATACACTTCAGCAAGTCACCTAACCTGCAAATTTCAAGCATGTGAATCTTGGATCTTTCA 
TGTGCTAGCTGTGAGACTTTGAGAAATGTATTTAATGTCTCTTTGCTTCCTTTTCTACCC 
AC AC AAT GGGT AT A AT AATGTCT ACC AT AT AT C TTT GC AGC AAGGTCT AAATGG GGTGAT 
ACATGCTGAATACATTTCCAACAGAGTCTGTGCAATGATAAGCTCTTTCCAAATGTTAGT 
TAAAGCTAACCAACTAACCCACCAACAAACCAACCTCTTAGCCAGGACTGATGGAAGGAG 
[C,T] 

CTGTGAGAGAATGCATTTAAAACACTTGGCACCATGCCTGACAAGAGTAAGTACTCGATA 
AATCAGTTATTGTTATTATCGCATCGGTATTATGACCATTATCCTCTTCTCTATAGGCTT 
CAGGTTTTCCTGTCTTTTTATCACAGCAGTATTCCAGCAGAAGCCTTTGATTTAACTAAG 
TCTCTACTGTGTGTGTGGCTAGATGCTATAAAGCATCCAGAGAAGTGAGAATTTGGTCCT 
GCTTTTAAGTAGCTTATAGTCTAATTAGGGGGAAGTAATCAGATAGAAAGGAAACTAACA 

314 37 ACTTGGCTTTGCCGGGGTAAGAGGGGGCACTTCTCTCCTTTCCCTCATGAAAGG AGGGAG 

AGAAGCCAAAAATCTCCCTACTAGTCAACAACTCAGGCACCCCTCCTTCTCTCCTCTATT 
TTATAGACTGGGAAGGGAGTGATGGTTGTTGGAGGTGGCAGAGCCAGTTCAGCTGCCTTT 
TGTGAAGTCCTGAAGGAGGTGTCTATCCTCAACTGCT.GGCTTCTGTCCTTAAGCCTGGGG 
AGAATTAAGTCCTCTTTGCCTCAGTTTGGCACTCCAATTGCCAACATTGGGACAGCAGGA 
(A,G) 

AAGTTCCATCCAACATCCCATTAAATATGTAATGTGTATTAGCACAGCGCCTGGCACTGG 
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GCAGGTATTTTCTAAGTGATAGCCAATGCGAAGCCTACTTTATTATTTTCCTCTTTGCTT 
AACCTACAAGGTGTCTAAGACCATTTGTTTGTCCACACATAGTAAGATAAACAGCACTGA 
GACTGTGGTCCTTTCTGCCCTGTGTCCTTATCCCACCTGGGAATCTGGAAAGCCAAGCCT 
AGACACACTCGTTCCACAAATGTTTACTGAAGCTTGTTCTATTCAAAGCACTGTACAGCT 

31857 TAACCTACAAGGTGTCTAAGACCATTTGTTTGTCCACACATAGTAAGATAAACAGCACTG 

AGACTGTGGTCCTTTCTGCCCTGTGTCCTTATCCCACCTGGGAATCTGGAAAGCCAAGCC 

TAGACACACTCGTTCCACAAATGTTTACTGAAGCTTGTTCTATTCAAAGCACTGTACAGC 

T AC AAAG ACC ATCTTTTC TG AACTCC AAACC AGGCC AC ATGGTTGG AAT AAC TT C AAGT A 

TGGAGACCAAGAGAAAAGGTGGTTGTTGTCAGCAAAGCTCTGAGTCCACACCTTCCAGGA 
( A, G] 

CTTATAGTTGATGCAATGGTGGGAGAAGTCTGAACCTGGATTCAATCTGCTTGATTCCGA 
TGAATGGTGCAGTAGGCAGAGCCATGAGTTCAGAGCAGGAAGAAACCACTGGTTCAAAGA 
AGCATCTGTCACATCGAAGCTGCTTTATAGTCTGTTGGGAAGCATGCATAATAATTTATT 
CTTTCTTTCTTTCCTTTGGTCAACAAAGATTTCTTGAGTCCCTACTATGTGCCAGGTACT 
CTTCTAGGTACTGAAGATGCAGCAGTGAACAAAGAAGATACAATCCCTGCCCAGCGGAGC 

3 3 1 b 5 ACAGTGCTGAATTTTCACAAATTGCGAATTAGGAAATTGTTGCTCATTTTAC AATTTGGT 

TTCCCTCAGGATTCCTTTTAAGTAGCCAGCTACCCCAGTACTTTTGAAATATGACTTGCT 

TATAAAAATTTGATAGGCTTGGCACGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAG 

GCCGATGTGGGGTGGATCACGAGGTCAGGAGTTCAAGACCAACATGGTGAAACCCTGTCC 

CTACTAAAAATACAAAAACTAGCCAGGCATGGTGGCACATGCCTGTAATTCCAGCTGCTC 
|G,A] 

GGAGGCCAGGCAGCTAGGCAGGAGAATCACTTGAACCCAGGAGATGGAGGTTGCAGTGAG 
CCAAGATCATGCCACTGCACTCCATCCTGGGTGACAGAGCAAGACTTCATCTCAAAAAAA 
AAAAAAAGATATATAAACAAGTTTTTATAATATTCTCAATATGAACTAGTAGAAAAAAAG 
CATCTGTTTTTAGGTCTTAGAGGCCTGGTTCCCAGTTTTATCTCTGACTCTAATGAGGTA 
TAGTATTACCTACATTGATTAGCCCTTCTATACTTCATAGGAGATGCTCCAAGACTGCTA 

39487 CACTTTGCTCCATCCCTTGGCCTTCTGCAGTCCAAGCTCCATTCTGAGATCATCCAAGGC 

TTCTCTTCTGTGTTGATCCTTGGCCTTCTTGGAGTCTCTTTCTCCCATGTTCTCCACAAC 

AGAGCATTCTCCTGACTGTTTTCATTCTGCATCTCACTCTTTCATCAGTATCTTTTTCTC 

T ACC ATGC CCC A TAAATT TGGGTGC TCCTGAGGGTCCTGTCCTTGTCCCCTGCTTTCTTG 

TTGTACAACCTCCTTGATCTACTTCATCTACTCAAGTTTGGTCCACAATTTCTATATTGT 
[G,C) 

AAGATTCAAATCTGCATCTCTAGCCATATATCCATTTGCCTGCTAGGCATTTCTACCTGA 
ATATTTTATAGGCATGCCAGTGGCTCTTACTCTATGGCTCTTACTCTAAGTCTAGACTAC 
AGCAGAAAGCAATGCTCTTTTTATTAAGGCATAGTGCCTCTTTCAGAATAATTTACAGCA 
TACAACCAGGCCTGCTGTGCAGCATTACAATTTGTCATTAAAACTCCATTCCTCTTGCCA 
GAGTAAATGAGCCATTTACAGCCAGGGCGCCAAGATGGACTGTTGTTATTTTTTCTGCCT 

41449 TCAGATTCC AGGAC ACCAAGTTTTCTGTGGGAGCTTCCCTAGGAATATAACTAAGGAATT 

TAAATCAGGTTCAGCTCATGCTGTTACACTCTCTTCCTCCACTCAGGCATTGGGTGTGGC 

TTTTCCAAGCTTGAGAAGGGTGTGATCTGAGATGGGCTTGGGTATAGAGGGGAATTATAT 

TTAGGTCTACCCTGTATAGGAAAAAGTGCCTTCCCAAAGTCTCCCTGGCCTAAAGTATAA 

GAGATATGTGTTGGGATTTAGACCCAGAGCCCAAGCCAATAATGGGACCCCCTTCTCACA 
(T,C] 

GTGGCTACCTCCTGCTATCACCACAACAGCTATCATACCCATAACTACAACAGAGGCCAA 
TTAACGTGGTGATAATTGACAAATGTCAAGACATCCTACATTGAGGCACACTGTGCGTTT 
TGCGTGAGCTTTTAAATTGGTAGGGAAGGAAAACTTTTATACCTACACCTATCATGGAAG 
GCAGAAGGTAAGAGCTAAAATAAAGGTATGCCAAGAACAAAGGCAGGAAAGAAGGGTTTT 
AACAACTTGAGGCCTGATCCATTGATTAGTGAAGAGGAAACATGTTCAAAAACCACTCTA 

42420 GGGTGAC ATGATAGATCTGTATTCT AGAAAAGTTAGTTTTGCAGCAGTTGTGTCCATTGA 

AAGGGACAGGATAAGGGAGATAGATAAGAAGACATGCTATGATGATAACTAGATTTGGAT 

ACCAAGTGGTATGGTGGAAAGGAATGAGAGAACAGGGTCACAGATGAATGACTGCCCAAT 

TTCAATCCATCATAACAGGATGTATAGGATTGCCCTTAAGTAAGATGGGGAATCCAAAAA 

CGAGGAACAAGTTTGTAAGGTTTTGGGGGCCAATGATGAATTCCATTTGGGACATGTTGC 
tT,C) 

TTGG AT AT AC C AAT GGG AC ATTC AT GTG AAAATG ATCTCGGC AATCC T ATCCTGGAAT T C 
AGGATAGGATCAGAATGAGGGACACAGTTTATAAGGTAAACAGAATGGAGGTGATATAGA 
AGATAAGGGCATAGATGAGCTTACCAAAGGGGAGAGTTTAGAATGAAAAGAAAAGACCAA 
AGGCTAAGCCTGTGCTATTCTTTCTCCTCACAATACGCTTCAGACCTGGGCACAAACCAT 
C AGTG AGT GT C A T G AT AAC ACT ACTGTGGGC AAATCCCCCCTCT AT AAGGGCCTG ATTTC 

43256 AG AAAAAC AATT AGAATGG AG AGCTAACTCTTTGGAAATGGTC AAAGAAC ACGGGTCTAC 

AAAACCGTCAATAAAGCGCTAAGATGCCTGGGCGGGGTCAAAAAGTCTACCTGGGCGGGG 

TCAAAAAGTCTACCTGCTCAGCATATGGGGCCCAGACATCTGACCTTTACCAACTCCACA 

ATAACCACTTCATCTATGGATCCAGTCTTGGTATCACCTAGTCGCTGTTTTCAAGTAACA 

GAATATTTGGTTCTCAATGGTAGGTGACTGGAATACAGCTTACTTTCTCCCACCCCTACC 
[G,C} 

CCAATCCTTTCTGCCCCCTTATAGTTTAATTTGCTTGTAAATTACTTGGGAATACATTTG 
GGAGCCATTATAGGGAAATAGAAGGCAGACATGATGAACAGAATGCAGGGTGTTTTTTAT 
TACTTCACATTGTGCTCAACAATTAGGAGGAATTCTAGAAGCCCCTCCCAGTGGCCAGGA 
ATTGGTCATAGCATGAATAAACTCAATATAGGTTGAGTATTCCTTACCCAAAATGCTTGA 
TACCAGAAGTGTTTTTGGATTTTGGATTTTTTTTTTGAATATTTGCATTATATACTTACC 
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GGGTTTTGGGATTAGGGATACTCAACCAGTGGTAGGTTTGGGATGATATCAGCATGCTAA 
GGTC AAAG AG ACCT AGCTGGG AAGGGTGGG AGGAAC ATGG AATT TTC ATTCT CTGGGC AC 
CCCTTGAACAGTCTTACTATTAGGGCCCCAAATTTGTTCTAAGTGTGTGTGTGTGTGTGT 
GTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAATTTTCTTTCTTCCTTTATATTCT 
AAGTTCCTCAGGACAAAATTTTGGGTTTCTTTGTATTCTCCCTGCAGCTCCTCATGTAGT 
[T,C] 

CT A AGC AAAT A AAGG AAT TC ATT AGGTCCTTG ATTTC AG AAGCC TC C C AGTT CT CT ATGT 
AGGAGGAATCTTAGGGTGGCAAGATAAGTTGAGGGACTTTTCTTCAAGCACATTTCACAA 
GTAAGAGAAAATGTTGACTGTGTATATCTAAGAATGGGTGGGGCTCAATGATGCCCCCCT 
AAGTTACTCTTTACTATTATTGATTGATTGATTGATTGATTGAAGAAGCAATGTTTTGAT 
TGATTGAAGAAGTAATGTTTCCAATGGCTACAGCAGACTGGAGCAAAAGAACAAAATGAA 

TTTTGGCTCTATCCTGGCTTCTTCACACAGGGGTGTCCAGTCATCTCATCCTGGTGGGAC 
AGGGATAGAGCTGTGGCAGTGGAGATGAGGAAGCTCGCCTCCTAAGTGAGTCTGAATTCT 
TAAATATGGAGCCACTCCATAATCATTTGGAGTGAATATTGGGCCATGGCCCTTTTTCTT 
GCCAGCTGAGCTATGAAAAAAGGATGTCCTAAGACCAGAGGCTGTGGGACCATTCCCAGC 
C C CT GC AGG AATC AAAGGAGCTG AC AG AATTGTTTGTTTGTTT TTTTCAC AAAT TG AAAA 
C",A] 

AAAAATGTAAAATTTTTGAAAAGAAAGCCTCATTGAAAAGAAATCCCTCTCCCCAGCTGG 
GCTCCCAGGCAGCCTCCTGCAGAACATCCTTAGCATTGCAGAGTTGTTCCCATGGCAACC 
G AGT AAG G GGC TTTT TGTTTT CCTT AG AAG ATTGAATCCT TTC AACC AG AAGGT AACC AC 
T GGTTCTT CC C C AC AAT CC AC ACTC C AAACCCCCT ACCCTT ATT TG AC T AC AT G ACT AGT 
TTTGCATTTATGGATTTTTTTATGCCTAATTGAAAAAGGCTAAATATACAGAAACTGAGG 

TGAGGGGTTATGAGACCATAGGCTCATTTTGGGGGGGGTCTAAAATGCAGTATTTTTTGA . ^ 

ACTGATATGGGGAAAAAAAGACATTTCTGAATTGTTGTCATGTTGCAGATTCTGGGCCGT 
TCCAGCATAAGCACCTTTCTTAGAGTACTTGGCTTTGTGAAGTAGTCCTTATCCCCTCCT 
TCCACTATTTTACATCAAGTTAAAATAGAGGAAGATGCCTAGAAATGGCCGTATAGACAG 
AGAAAACTGCACTAAAACTCCCTCCGTCATGCCTGACTCCTCTCTAGACTATGACCATCG 
[A,T1 

GGGGCCAGAAATCATATCTTAAAGATCACTGTGCCTCCAGTACCCAGCACGGTGTTTAAT 
AAATGTTTGTTGAATGAACGAACTAGTAAAATTTTCAAATCATTAGAGCTGAAGTATCCT 
TTAAGATTCTTTAGTCCCTCATTTTACAGATAAGGAAGCTAAGGCTCAAGACATTGTGTG 
GCTTGGCCAAAGGCACACAGCAAGCTAAAGGCAGAGGGAGGACAGGACCCGGCTGTCTCA 
ACCCCCTGGCTGCTACACTTCCTGCAGCATTTCTAATTCTTTTACCATTCTTGCGAGGGA 

CCAATGGGGAAGCACCAGGGTCAGCCGCAAGGCAGAAGGAGCAAGAGGAAAACATGGACA 
AGAGGCTCTACTGTGGATTCAGTGGCAAAGAATGGGAGGGGCAGAGTAAGCAGGTTTAGG 
ATTATCGGGTTTGAATGACTTGATTGAGCTGTAGGGTGTAGAGACTGCCTCTACTGTCTG 
GCACCAGGGGTAATTAGGGCAGCTGGATAGTGGTCTGGAGTGTGAGAGCTCCCTAAAGGA 
GGTGGTTGGAGGTGTAGGTTTTGGATTGGTTGATCTGTATATGAAAGGTGCACGTGCAGG 
[O, T] 

TGAGTCCTCTACTATCACTAGAAATTGGCTGGTCCCAGGAGAAGTAGTCTCTCTAGAGAC 
AGCAATGCCCCAGATGTCAAAGCATCAGAAAATACAGAAAAAAAATTAAAAGCATGATTA 
ATTCATACTCACAGGTCTAGTTTTTGTGTAGTTAAGAGCAACCTAAAGAAGTTGATAACT 
CGTGTTGCAGGTCAGGTTTCCCAGAAATCATATTCTCAGATGAAGATTTGCATGAAGGAG 
GTTTAATGCTCAAACTAAGCCCTAAGGCTCCATACCTGTGGAGGAAGTGAAAGAAGCCCA 

TAGTGAGGCACACTTACTTCTTAATTTGTGCCACCCACTTTTCAGGCTCCCTTAGGACAG 
CCTCCACCTGCTCCTACTGTGCTTCCCATCGTCCCTCTCCTCAGGCACAGGCTGAGGAGT 
AATAAGAGCACCTGATATGTGTCAGGCCTTACTGTGTGCTAGGAATTGTGCTAAGTACTT 
CCTATGAATTTTCCATTTATTCTTTATAATAACTTTGTAAAGTTAGAGCCATTATTCCAG 
AAG GG AAA ACCGAGGCAATGGGAGTCAAAGC AAAG AATTTGGGCTTTT AACC ATT AC ACT 
t A, G] 

TT TT GC AC A AG T AGCC AGT AATG AAAAGGCTGCT ATCCGG AAT C AT CTTT GC AAAAGGT A 
ATTT C T T T AGC AC TT T ATC AG AAG AAGGGGGCTCCTT CCTC AAAT TCT G AGGG AAG AGAA 
GTGGGGAAGAAAAGATGACTGAATCCAAAGCTCGGGCAGGGAAAGCACATCGAGTGCCAA 
GTGCGCTGCGCTGGGGTCTAGTCCTGACTCAGCCGCCATCTTCCCAAGTGCTTCCTGGAA 
TTCTCTCCTCTCGTGGGGCCTCAGCTCCTTCATCTTAGGAAAGAAGGGTAAAGATCTACA 

CACTTTTCAGGCTCCCTTAGGACAGCCTCCACCTGCTCCTACTGTGCTTCCCATCGTCCC 
TCTCCTCAGGCACAGGCTGAGGAGTAATAAGAGCACCTGATATGTGTCAGGCCTTACTGT 
GTGCTAGGAATTGTGCTAAGTACTTCCTATGAATTTTCCATTTATTCTTTATAATAACTT 
T GT AAAGT T AG AGC C ATTATTCC AG AAGGGAAAACCGAGGC AAT GGGAGTC AAAGC AAAG 
AATTTGGGCTTTT AACC ATT AC AC TATTTTGCACAAGT AGCC AGT AATG AAAAGGCTGCT 
(A,CJ 

TCCGGAATCATCTTTGCAAAAGGTAATTTCTTTAGCACTTTATCAGAAGAAGGGGGCTCC 
TTCCTCAAATTCTGAGGGAAGAGAAGTGGGGAAGAAAAGATGACTGAATCCAAAGCTCGG 
GCAGGGAAAGCACATCGAGTGCCAAGTGCGCTGCGCTGGGGTCTAGTCCTGACTCAGCCG 
CCATCTTCCCAAGTGCTTCCTGGAATTCTCTCCTCTCGTGGGGCCTCAGCTCCTTCATCT 
T AGG AAAG AAGGGT AAAG ATCT AC AG AC AAATTGATCTTT AAGT ATCCTT AG AG C AC T AC 

TGAAATACTTTAAAACTTGTAGCTTCCTTCAGCACAGAAGTGGCTCTCTGAACCAATTTT 
AAGCAATCCTGGCTCTATCTGTGCATGTTGATTTAGCCTGTGGTTATAGTGTTAACAATT 
TAGTGATTCACCTCATTTTTAATCTCTCTTTCCCTTTAGCAGGATCATTTTCTCTGTGTT 
AAGGG ATC AAC ATTG AGGT AAG AATGGCT AAAT AAT AGC ATCT TCTGG AAT AC AAATG AC 
TTTATAAATAAAAGAAGATAAAAGGAAGAAGTAGGATGATTTCTCAGCTCTAATACACTT 
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[A,G] 

GCAAATGCCATATGCTTTCTCCTGCGTGTACTGGTCAGGCCAGTTCTAGATACAATCATG 
CGCTGCATAATGATGTTTTGGTCAACAGTGGATTGCATATGTGACGGTAGTCCTTTAAGA 
TTATAATACCATATTTTTGCTGTGCCTTTTCTAGGTCTAGATATGTTTAGATACACACAT 
ACTTACCATTGTGTTCCAATTGCCTACAGTTTCCAGTACAGTAACCTGTTGTACAGGTTT 
GTAACCTAGGAGCAATAGGCTATACCATACAGCCTAGGTGTGTAGTAGGCTATACCACTT 

ACTGTCCTTCCTGTGTCTGAGGGAAGGCATGTAACTCTTGCTTATCTTCACCTGTGCTCT 

AGATCCTGACCTTCTCTGGCAACCTCAGGGACCTTGCACCATCCATTCTTCTCGCCTAAT 

GGCGAGACTCAGTCTCTCCCTCTCCCTTTCCACTCTCCCTTGCCATTCTTAGTATCTTTC 

TACAAGCAGGTCTTCCAAAGTACTGCTTGAGGTCTGAGTTGGAGGGAACATGCCTCTACC 

CTACTAAAAAGAGAAATTCCTCTGCAGAAGACCCAAGCTGACTGACAAATCCCTTTACTG 
[C,TJ 

AACTGCAGCTCTAGCTCCCACCATTTTCCTGTACTTACTCTCCTGCTCAGGTTCCCTGGC 
ATTGCTGATGTCTTTCAGCCTTTGTGCCCTGGCCCCTTTCCTCCTCTCCCCTCATCTAGC 
ACTACCTGTCAAAATCAGGGACTTACTTTAAAATTTATCCCAAATTATCATTGCCATCAT 
CTCCACTGTCACCTTATCATATGTTTGAATAGCGTTTCCATTTCCCAAATGTTTTCGCAT 
GCACTTTCTCAATTGAGCCTTACGAATCCTAGAGCTGAGAAGGGTAACAATTTATGAGTC 

AT AC A AAAATT AGC C AGTTGTGGTGGC AT GC ACTGT AGTC TC AGCTCC TTGGG A GGCTG A 

GGCAGGAGAATTGCTTGAACATAGGAGGTGGAGGTTGCAGTGAACTGAGATTACGCCACT 

GCACTCCAGACTGGGAAACAGAGTGAGACTCTGTTTTATATATATATATATATACACACA 

CGTACATATACATGTATATATATACACATTATTATTGAAAGCAGCCAAAGAAAAATAACA 

CATTATATATAGAGAAAGAGCAAATGATGAGTGACTTTATATGTATATATATGTGTGTGT 
l G, A] 

TATATATATAATGTGTATATATATACATATATATATATAGGTTAAGAACCTTCAGCACAT 
GTATACCTATGTAACAAACCTGCATGTTCAGCACATGTATCCCAGAACTTAAAGTGAAAA 
AAAAAAAAAAGAACCTTCTGCATGCCAGTAACTGTGCTAAGTGATTAGGATGCAATGGTA 
ATAAAAACAAAGTCCCTCTCCTTAAAGAATTTTCTATTTAGAAGGGAAAACTGGTAAATA 
AAAAATAAATATATAAATTACAATTTGTGAAAAGTGCTACACATGAAAGAGTGCTGAGAC 

TGTATATATATACACATTATTATTGAAAGCAGCCAAAGAAAAATAACACATTATATATAG 

AGAAAGAGCAAATGATGAGTGACTTTATATGTATATATATGTGTGTGTGTATATATATAA 

T GTG T AT AT AT AT AC AT AT AT AT AT AT AGGTT AAG AACCT TC AGC AC ATG T AT ACCT ATG 

TAACAAACCTGCATGTTCAGCACATGTATCCCAGAACTTAAAGTGAAAAAAAAAAAAAAG 

AACCTTCTGCATGCCAGTAACTGTGCTAAGTGATTAGGATGCAATGGTAATAAAAACAAA 
[G,A] 

TCCCTCTCCTTAAAGAATTTTCTATTTAGAAGGGAAAACTGGTAAATAAAAAATAAATAT 
AT AAATT AC AAT TTGTG AAAAGTGCT AC AC ATG AAAG AGTGC TG AG AC AG AC ATC AATG G 
ATAAACTTTAGATTGAGAAGGGCTCTGACAAAGCAACATTTAAGGTGCAACCTGAGAGAA 
TAGAAGTTAAACAGGCAGATATTGGTGAAAGAGCAGTCTAGGCAGAGGGAACATCATTTG 
CAAAGGCCCAGGGTAAAGAAGATCCTGGTAAGGAAATGACAGTGGAAGAAGGTTAGTGTA 

TATATATATACATATATATATATAGGTTAAGAACCTTCAGCACATGTATACCTATGTAAC 

AAACCTGCATGTTCAGCACATGTATCCCAGAACTTAAAGTGAAAAAAAAAAAAAAGAACC 

TTCTGCATGCCAGTAACTGTGCTAAGTGATTAGGATGCAATGGTAATAAAAACAAAGTCC 

CTCTCCTTAAAGAATTTTCTATTTAGAAGGGAAAACTGGTAAATAAAAAATAAATATATA 

AATTACAATTTGTGAAAAGTGCTACACATGAAAGAGTGCTGAGACAGACATCAATGGATA 
(A,C) 

AC TTT AG ATTG AG AAGGGCTCTG AC AAAG CAAC ATT T AAGGT GC AACC TG A GAG AAT AG A 
AGTTAAACAGGCAGATATTGGTGAAAGAGCAGTCTAGGCAGAGGGAACATCATTTGCAAA 
GGCCCAGGGTAAAGAAGATCCTGGTAAGGAAATGACAGTGGAAGAAGGTTAGTGTAGCAG 
GACTGTGGCTAGGGCGGAGAGGCAGGGAAGTAGTTTAGAATTTCAATGCAATAGGAAATA 
TGGAAGATTGAAGGCAGTTTTGCATTATAAAATAATATGATTGCTATTTTAAAGCTACTT 

CCAGGCTGTACCAGGCACTCAGATATGACAGTGAATGAGATAGGCAACATCTTTGCCATT 

GGAGAGCCTACACTGAAGTGGACATGAGGGAGTTGAAAGCAACTCTTATAGGAAATCATG 

GTAAAGACGTCCAAGAGAAGAAAGATGAAGGGCAAACACATGCACGGATGCCAAACATCT 

ATCAGAGAGAAAGGAATTTTCAGACCTGACCTGAATGATGAAAGGAGGTTTTTGGAAAGG 

AAAATAGAAGGGAAGGACAAGGGAAATTATCTGGGCAGCAATATTTATCTGCTGTGGTGC 
(T,C] 

TCACTCTCTCTCTAATCCTTTTCCACCCCAGCCCCAAATTTGAAAGGATTGCAGGGAGCT 
CCTGCTGGAGTCATTTCTGGTATTAAAAATGTACAGAAAGGAAAGCTTTGGTTCTGAGTT 
TGCAGGCTTCCCTGTCTTTCATTCCTATTGTAGAAAGCAGCTTATATAAAAAGATGTGCT 
GTGTGGCCCTTTGAGCTGCTGTGATTGTGTTAGGACCCCACTGGATGGTATTCGCATGAA 
TT AAT CT ACTGT AGC ATCTCT AC AAATCAAGAGGCTGGCTTCTGTTTGAAATGTCCC AAG 

ATTAAAAATGTACAGAAAGGAAAGCTTTGGTTCTGAGTTTGCAGGCTTCCCTGTCTTTCA 

TTCCTATTGTAGAAAGCAGCTTATATAAAAAGATGTGCTGTGTGGCCCTTTGAGCTGCTG 

TGATTG TGTTAGGACCCCACTGGATGGT ATTCGC AT GAATTAATCT ACTGT AGC ATCTCT 

AC AAATC AAG AGGC TGGCTTCTGTTTGAAATGTC C C AAGGCT TTGTGC AC AGGGC AAGCT 

AAATGTCTCCCTACAGTGAGACTGAAAATGCCTTGGGTGCCCTTGTCGATAGGATCTGAT 
(A,G] 

TATAGATGCATGTCTACAATTGCACAGTGGCTGCTGGCAACATTTATTACAATCTGAATG 
TGAAATGGCTATTCTGTTCAAGGATTCTGATAAAAAGTATCAGCCACAGTAGATGTATAA 
GGAGCCTGGTTTCACTGCAACTGACTACAGTTATCTGATTTTTTTTTTCTAGTTCATTTT 
TAGTCTGTGGAGCAAACAGAGATTTCCTCCCCAAATGATGTCCTTTCTCAGTCACCAGGG 
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TGTGGTTATTTGGTTTTATGTAGAGGAGATAGAAACCAATCAGTCTAAATCATATTCTGT 



AAAGATGTGCTGTGTGGCCCTTTGAGCTGCTGTGATTGTGTTAGGACCCCACTGGATGGT 
ATTCGCATGAATTAATCTACTGTAGCATCTCTACAAATCAAGAGGCTGGCTTCTGTTTGA 
AATGTCCCAAGGCTTTGTGCACAGGGCAAGCTAAATGTCTCCCTACAGTGAGACTGAAAA 
TGCCTTGGGTGCCCTTGTCGATAGGATCTGATATATAGATGCATGTCTACAATTGCACAG 
[T,C) 

GGCTGCTGGCAACATTTATTACAATCTGAATGTGAAATGGCTATTCTGTTCAAGGATTCT 
GATAAAAAGTATCAGCCACAGTAGATGTATAAGGAGCCTGGTTTCACTGCAACTGACTAC 
AGTTATCTGATTTTTTTTTTCTAGTTCATTTTTAGTCTGTGGAGCAAACAGAGATTTCCT 
CCCCAAATGATGTCCTTTCTCAGTCACCAGGGTGTGGTTATTTGGTTTTATGTAGAGGAG 
ATAGAAACCAATCAGTCTAAATCATATTCTGTTGAAATCAGAACCAAAGGATCCACAATC 

GGTTTAAGAAAGCAAGGTCAGGTAAGCTTCACAAAGTAAGTCAAGAAGTATTTTCACCTT 
TATCTTCTGAAAGAATTTATGCAACGTTGAAATTATTTGTTTCAGAGATGGTCAACAGAA 
TATACCAGAGAAACTATTTGGACTTAGAGCTTCCTTGGGGGAAGGTTTTTGATAAATAAT 
GCAATTTCTTTAATACATAGTACTTATATTTTCTATCTTACCTTGTGACAATTCTGATGA 
ATTGTGTTTTTCAAGAAGTTTGCCCATGTCATCTGAGTTGTTAAACTTACTACAACAAAG 
[T,GJ 

CTTTGATAATATTCCTATATTAGCCTTTGAATGTCTATAAGATCTGTCCTGATGTTCCCT 
CTCTCACTTTTTTAAAGAAGTCTTGCTAGAGGTTTACCAATTTTATTTTGTTTTATTTTA 
TTTTATTTTTTCTTATTTGAGACAGAGTCTCGCTTTGTCGCCCAGGTTGGAGTGCAGTGG 
CTCGATCTCGGCTCACTGCAAGCTCTGCCTCCCAGGTTCACGCCATTCTCCTGCCTCAGC 
CTCCCGAGCAGCTGGGACTACAGGCACCAGCCACCATGCCCGGCTAATTTTTTGTATTTT 



GTCTTTTCATTGTCTTTTATTCAATATATTCTAATTTTTCTTGTGATTTCTTCTTTGGCC 
C AT AGGCT GTTT AG AAAT ATGT AGTT AGTT TCC AAAT ATTCGAAG AC TTTC AC AG AT ACC 
TTACTATTATTGATTTCTAATTTAATTCTGCTACAATCCAAGTATATACATTATAAAGTT 
TCAGCCTTTTGAAATGTATTAAGAATATTACCAGAGATAAGAAGATAAGAATATTACCAG 
[C,A] 

GATAAGTAGGGATATTTCATAAATAATAGACGAATTGATTCATCAAGAATATACAACAAT 
CATAAATGTGTATGTGTCTAATAACAGAGTCTCAAATTATATGAAACAAAACTGACAGAA 
CTAAAGAGAGAAATGGCCAATCCCACAATCTTTATCTTTATCAGGTGATTTATCTTGGTG 
AACATTCCTTGTGCTCTTGAAAAGAAAGTGTATTCTGTAGTCATTGGGTATAAAATTCTA 
TATATGACAATGAGGTGATTGATAAAATTATTTAGATTGTCTATATCCTAAGTTTTGTAG 

TCCGTTATTATTTATGCAATGTCCCTATTTATCTCTGGTCATATTCTTTATCTTGAAGTC 
TTTTTAACTGATATGAATGTAGCCACTTCATCCTTTTTATGCTTACCATTTGCATAGTTT 
ATATTTTTCCATTATCTTATATTCACACTATTTATCCCTTTATACTTAAGTCCATGTCTT 
GTAGACAGTATGCAGTTAATTGTGTCTTGATTATTTTTACTCCTTTCTGACAATTTCTGC 
CTTTCCATATAATATGCTTATCAATACAGTTGGAGTTAAATCTACCGTCTTGTTATTTGT 
[C,G] 

ACATCTCCCATCTTTTGTTGTTGTTCCTCATTTCCTTGTTTATTACCTTCTTTTCAGTTA 
TTTTTTTTTTGTATTCCATTTTAATTCCTCAATTGGCTTTATAGCTATATATCTTTGTAT 
TATTTTTTATTGTTTGCTCTAGGGATAGCAATATGTATACTTACCACAGACAATTTAGAA 
ATCATATTGTACCACTTCACATAAAATAGAAGAAGCTTGCAGCAGTCTATGTCCCTTTAC 
ACTCCCATTCTTTGTGCTATTGTTTCCGTATGTATTACATCACGTACATTGTAAAATCCA 

TTTTAACTGATATGAATGTAGCCACTTCATCCTTTTTATGCTTACCATTTGCATAGTTTA 
TATTTTTCCATTATCTTATATTCACACTATTTATCCCTTTATACTTAAGTCCATGTCTTG 
TAGACAGTATGCAGTTAATTGTGTCTTGATTATTTTTACTCCTTTCTGACAATTTCTGCC 
T TTC C AT AT AAT ATGCT T AT C AATAC AGTTGG AGTT AAATC T ACCGTCT T GT T ATTT GTC 
ACATCTCCCATCTTTTGTTGTTGTTCCTCATTTCCTTGTTTATTACCTTCTTTTCAGTTA 
[T,AJ 

TTTTTTTTTGTATTCCATTTTAATTCCTCAATTGGCTTTATAGCTATATATCTTTGTATT 
ATTTTTTATTGTTTGCTCTAGGGATAGCAATATGTATACTTACCACAGACAATTTAGAAA 
TCATATTGTACCACTTCACATAAAATAGAAGAAGCTTGCAGCAGTCTATGTCCCTTTACA 
CTCCCATTCTTTGTGCTATTGTTTCCGTATGTATTACATCACGTACATTGTAAAATCCAC 
AATAGAGTGTTATAATCTTTTTCCAAATCCTTGTGTGAATTAAAAATTTTATGAGTAGAA 

CAGTATGCAGTTAATTGTGTCTTGATTATTTTTACTCCTTTCTGACAATTTCTGCCTTTC 
CATATAATATGCTTATCAATACAGTTGGAGTTAAATCTACCGTCTTGTTATTTGTCACAT 
CTCCCATCTTTTGTTGTTGTTCCTCATTTCCTTGTTTATTACCTTCTTTTCAGTTATTTT 
TTTTTTGTATTCCATTTTAATTCCTCAATTGGCTTTATAGCTATATATCTTTGTATTATT 
TTTTATTGTTTGCTCTAGGGATAGCAATATGTATACTTACCACAGACAATTTAGAAATCA 
[T,C] 

ATTGTACCACTTCACATAAAATAGAAGAAGCTTGCAGCAGTCTATGTCCCTTTACACTCC 
CATTCTTTGTGCTATTGTTTCCGTATGTATTACATCACGTACATTGTAAAATCCACAATA 
GAGTGTTATAATCTTTTTCCAAATCCTTGTGTGAATTAAAAATTTTATGAGTAGAAAAAT 
ACATATAACATTTTATTCTTACCTACATACTTACCAGTTCTGCTTTCTTTTCATTCTTAC 
CTGTTTCAGTCTTATCTGTAAACCCGTTTTCATTTGGTGTCATTTCCATTAGCATTTCAG 

GGCATGTCACTTCCTGCCCCAAACCCCTTGGTAGCTTTCCATTGCTCTTAGAATAACTTT 
GT G ATCT AC AAC ATCTTC TT C AAGGCCCC GC AT G AT AC AAATT C TGGC T ATT TCTCT AGT 





-COCID: <WO 0233086A2_1_> 



ACTTCTACCACCAGGTTTTTTCACACGTTCTTCTTTCCCCATTAACAATGATCCACCATT 

CTCTTTCTTTATCCACTGTTACTCATCCTCATAACTGAAACATCATTTCCTAAGGATGGC 
(C,T) 

ATTCCTGGTTCAGTCAGTCTATATTTCATCCCCCATCACATACTCTTGTTTTACCCTATA 
TTTTTCCTTCAAAGCACTTATTTAAGTTGTAATTATGTGTTGTTTATTTTATGTCTGTCT 
GCCC T C AC AG AATCC AC AGTCC AGG AG AAC AG AAATCCTGCCTCTTTT ATTT AT ACC AC A 
TCCACAGTATTATTAGTGCCTGTCACCTAGTAGGTATGCAGTATGTACCTATTGAATAAA 
TGAATTGACTTCTGTCTTTTAGATCGTCTACTCATTTTATCATTGATGACAAACATAATA 

TATTGCACCACCTTGTCCCTCATCCACCTTTTTTTTAGTCTTCTCTCTTTCTTTGAACTT 
CTACCACCAGGTTTTTTCACACGTTCTTCTTTCCCCATTAACAATGATCCACCATTCTCT 
TTCTTTATCCACTGTTACTCATCCTCATAACTGAAACATCATTTCCTAAGGATGGCCATT 
CCTGGTTCAGTCAGTCTATATTTCATCCCCCATCACATACTCTTGTTTTACCCTATATTT 
TTCCTTCAAAGCACTTATTTAAGTTGTAATTATGTGTTGTTTATTTTATGTCTGTCTGCC 

TCACAGAATCCACAGTCCAGGAGAACAGAAATCCTGCCTCTTTTATTTATACCACATCCA 
CAGTATTATTAGTGCCTGTCACCTAGTAGGTATGCAGTATGTACCTATTGAATAAATGAA 
TTGACTTCTGTCTTTTAGATCGTCTACTCATTTTATCATTGATGACAAACATAATACCTT 
ACATTCGTGTAGTCTTTTTCACTCCTCAAAGAGGATTTTCTGCATAGCTCCTCTGAGCCT 
C AC AAA AC C C T T T A AGG AAG ATTGT GAAT ATT AT C AGAT AAAG AT TGTG AG AC AC AGAAA 

TCCAGTCATTTATAAAAGATGAAGAGGAGAACAAGGTAGGCCAAAGTGGCTTTGTACTAT 

TAAAGGCTGCTTGATTTCTAAGTACATGTTCTTTGCCACCTTTCTGCCATTCCACATTCT 

AGAAGCCATGGGTAAGTCAGCACAGGGATCTTAACATGATAACATTGGTTTTAGGAGGTC 

TCGTGCATAATGGACCAGACTTAGAGCACAATGCTGTAAGGTAGTGATTTAGGTGAGCAG 

CAGATTCTGGCTTTAGGAGTTTATTATCAGATGCTTTTTAAACGACTTGTGGCCCAGGAT 
(C,T1 

CCTGCACCCATGGGAAGCATTGTAGCCTTAGAACTCTGGGAATTCTGAATATAATTCCTG 
AATCAATCGTAAGGATGCATATCTGATGCTTAGTGCAAACCAAGAGGCAGAATATTTGCA 
GGCAGTGTATCCTTGAAAAACAAATCTAGGTCATTTTCCTGCCATGCTTCAAGCTTACTT 
TTCCATCCTTCCTGATGGTAGTACTAACTACATTTGTAGACCATTTACGTGGTCAACACT 
GTGCTAAGCTGTTAGCTTCATTCTCTATGAGACAGGCACTCTTAGCCCAACTTTACAATT 

TCTGTCTGGCTTTTCTCAACCTTTCTCTTCTGCACTTTCTTGGATATAATCAAAGCACTA 

CCAGGAACTCCAGAGTCGGCACCTTTTCATTTTTGTGTTTTCATTTAATTATTTCTCAGC 

TGCTAAGTGTTTGACTGTTTAAGGGACTCTAGTGGTAAATATTTGTCTTTAGCCTGGCAG 

AAGCTGTGGTTTCCTTTGATGAGCTCACACGGTGTGGCTTTTAAGATGCTGCTGACCAGG 

ACAGCTGACTGTCCCCAGTGGGTGCAGTCCCCAGCAGTGGGCTGGACCCCTTCCAGAAAG 
ICT) 

GCTGCTGGGCCAAGAGGCTTCCTCCAACTTCCCGCTGCCCCCATCTAACCAACACCTCAG 
TCTCTTCTCCACCTGCTTCCCTGCCCTCTTCCTTTCCCTCGCAGACACTTTCTTCTGCCT 
GGCAAAAGGAATCTTGTTTCCATGGAAGCCTCATTAAATCTGCATCTTGCTCAGTTTGGG 
TTTGATCACGGCTGCCAGAAGTATTTTTAGCCCATGCAGTTGCGTAATGAGATAGAGATT 
GGGGAAAGGGGGAGGTGACTGTATAGGCAGAGGGTTTTTTTAAAAAAAAGTGAGAAAGAG 

ACTTTCTTGGATATAATCAAAGCACTACCAGGAACTCCAGAGTCGGCACCTTTTCATTTT 
TGTGTTTTCATTTAATTATTTCTCAGCTGCTAAGTGTTTGACTGTTTAAGGGACTCTAGT 
GGT AAAT AT T TGTC TT T AGC C TGGC AGAAGC TGTGGTTTC C TTTG ATG AGCTC AC ACGGT 
GTGGCTTTTAAGATGCTGCTGACCAGGACAGCTGACTGTCCCCAGTGGGTGCAGTCCCCA 

GCAGTGGGCTGGACCCCTTCCAGAAAGCGCTGCTGGGCCAAGAGGCTTCCTCCAACTTCC 
[C,T] 

GCTGCCCCCATCTAACCAACACCTCAGTCTCTTCTCCACCTGCTTCCCTGCCCTCTTCCT 
TTCCCTCGCAGACACTTTCTTCTGCCTGGCAAAAGGAATCTTGTTTCCATGGAAGCCTCA 
TTAAATCTGCATCTTGCTCAGTTTGGGTTTGATCACGGCTGCCAGAAGTATTTTTAGCCC 
ATGCAGTTGCGTAATGAGATAGAGATTGGGGAAAGGGGGAGGTGACTGTATAGGCAGAGG 
GTTTTTTTAAAAAAAAGTGAGAAAGAGAAGGAAAACCTCTAAAGAAAAGAGTTTTATGGA 

CTTTCTTGGATATAATCAAAGCACTACCAGGAACTCCAGAGTCGGCACCTTTTCATTTTT 
GTGTTTTCATTTAATTATTTCTCAGCTGCTAAGTGTTTGACTGTTTAAGGGACTCTAGTG 
GTAAATATTTGTCTTTAGCCTGGCAGAAGCTGTGGTTTCCTTTGATGAGCTCACACGGTG 
TGGCTTTT A A G ATGCT GC TG ACC AGG AC AGCTG ACTGTCCCCAGTGGGTGC AGT CC CC AG 

CAGTGGGCTGGACCCCTTCCAGAAAGCGCTGCTGGGCCAAGAGGCTTCCTCCAACTTCCC 
[G, A] 

CTGCCCCCATCTAACCAACACCTCAGTCTCTTCTCCACCTGCTTCCCTGCCCTCTTCCTT 
TCCCTCGCAGACACTTTCTTCTGCCTGGCAAAAGGAATCTTGTTTCCATGGAAGCCTCAT 
TAAATCTGCATCTTGCTCAGTTTGGGTTTGATCACGGCTGCCAGAAGTATTTTTAGCCCA 
TGCAGTTGCGT AAT G AG AT AG AG ATTGG GG AAAGGGGG AGGTG ACT GT AT AGGC AGAGG G 
TTTTTTTAAAAAAAAGTGAGAAAGAGAAGGAAAACCTCTAAAGAAAAGAGTTTTATGGAA 

TATTTTTAGCCCATGCAGTTGCGTAATGAGATAGAGATTGGGGAAAGGGGGAGGTGACTG 

TATAGGCAGAGGGTTTTTTTAAAAAAAAGTGAGAAAGAGAAGGAAAACCTCTAAAGAAAA 

GAGTTTTATGGAATTGGAAGAAGGATGGAGCACCTCTTTTGGGAGCATGAGGCTGGTGTT 

CTCTGGTTAGCTCTTCCCACTGGAAGCCCATGGACACTTGCCATAATACCTGTCCTGGTC 

ACATGTCAGGGGAACCTCTGATCTCCCTTTCCATGAGCTTAGTTGGCCCAGCCAGGGTGA 
(C,T] 

ACTTATGCTAGGGAGTGTGATTGATGTTGCTGCTTACAGATTTCCCCTCCCACAGACCTG 
ATGGGGCAGCCAGGATAGTGGCAGAGAAGAAGACAGAGCAATAGCAGGAAAGAGAGGACA 
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AC AC T AAC AC ATTGG AGGTTT ATGTTC AAAG ACGGG ATCT AGGGGGTC AG AG AAAGC AC A 
CCTACCATGTAATTGGTGCTGGAATCTGATGCCAAGTGCACCCTTGGCTTCTGAGGTTCT 
GAGAACTCTTGCTTGTGCTTTTCAGCCAGACTATGCCCTCACCTGCCCCTGTACTTTAAA 

72992 TGTTTGC ATTGGATTGTTGGAGTGTGTGTGC ATGTTGTTGTGTTCTTGT ATTAC AAGAC A 

AAG AG ATT AAAAAAAAACC AC ATGC AGC TGTC AC AGC T AATG TTT ATTG AAC TT T TACT A 
TGCCACATGGTGTTTTAAGCATTCTATATGTGTTAACTCATTTTCCCTAATTCTATGGAC 
TAGACACTTAAACAGTCTCCATTGTACAAACAAGGAAACTGAGGCACAGAGAGGTTGGGA 
AACTCATTTGAGGTCCTCCAGCTAATTAATAGTGGAGCCAGGTTTTGTACCCAGACAACC 
[T, G] 

GATTTGAGAATCTGCAGTCCTAGATTAGTAACGTGTTGTTGGCCTGTCACACATTTTAAA 
TGACATTCTGTACACAGAACCATTTATAGTAACTTTGTATTGTTGAGCTGAAAGCAGTCT 
GCAGATGTGCTGCTGGGATTTCATTCATCTTCAAAGAGGTGTTTTTTTTTTTTTTTAAAG 
GAAAATGCTTTTCTGAGGGTGGTATCTAAATTCATAAAAATCTTTACGATCAAGATTTTC 
ACAAATTTCATTCTGACTCTGTTGCATTGCCCTTCTTCCCATATTCCCAGTTAGTTTGTA 

73154 TTCCCTAATTCTATGGACTAGACACTTAAACAGTCTCC ATTGTACAAACAAGGAAACTGA 

GGCACAGAGAGGTTGGGAAACTCATTTGAGGTCCTCCAGCTAATTAATAGTGGAGCCAGG 
TTTTGTACCCAGACAACCTGATTTGAGAATCTGCAGTCCTAGATTAGTAACGTGTTGTTG 
GCCTGTCACACATTTTAAATGACATTCTGTACACAGAACCATTTATAGTAACTTTGTATT 
GTTGAGCTGAAAGCAGTCTGCAGATGTGCTGCTGGGATTTCATTCATCTTCAAAGAGGTG 
f-,T) 

TTTTTTTTTTTTTTAAAGGAAAATGCTTTTCTGAGGGTGGTATCTAAATTCATAAAAATC 
TTTACGATCAAGATTTTCACAAATTTCATTCTGACTCTGTTGCATTGCCCTTCTTCCCAT 
ATTCCCAGTTAGTTTGTATTGATTGCTGCATCTCCCTTGAGCCCATGGTCCCCCACAACA 
TTTCTTGCAGAACTGTGTCCTGCCTTCACACTGTCAGGCAGCAGGAGCCTCTCTAGCGGC 
CAGCCCACAGTCCTGCAGCTCCTTCCTCAGGACGTTTAATTTCCCACATTTCTATGCAGT 

73164 CTATGGACTAG ACACTTAAACAGTCTCCATTGT AC AAAC AAGGAAACTGAGGCACAGAGA 

GGTTGGGAAACTCATTTGAGGTCCTCCAGCTAATTAATAGTGGAGCCAGGTTTTGTACCC 
AGACAACCTGATTTGAGAATCTGCAGTCCTAGATTAGTAACGTGTTGTTGGCCTGTCACA 
CATTTTAAATGACATTCTGTACACAGAACCATTTATAGTAACTTTGTATTGTTGAGCTGA 
AAGCAGTCTGCAGATGTGCTGCTGGGATTTCATTCATCTTCAAAGAGGTGTTTTTTTTTT 
l~,T] 

TTTTAAAGGAAAATGCTTTTCTGAGGGTGGTATCTAAATTCATAAAAATCTTTACGATCA 
AGATTTTCACAAATTTCATTCTGACTCTGTTGCATTGCCCTTCTTCCCATATTCCCAGTT 
AGTTTGTATTGATTGCTGCATCTCCCTTGAGCCCATGGTCCCCCACAACATTTCTTGCAG 
AACTGTGTCCTGCCTTCACACTGTCAGGCAGCAGGAGCCTCTCTAGCGGCCAGCCCACAG 
TCCTGCAGCTCCTTCCTCAGGACGTTTAATTTCCCACATTTCTATGCAGTTACCTCACAG 

7 414 9 TTTGCTCAAGGTCACATAACTAGTAAGTGGGTGGAGCTGTGATGTGAAACTGGGCAGTCT 
GATTCTGGGACCTGTGCTCTTAATCACCAATCTATATTGCCTCCTACTTGAAAACATCCA 
GGGAAAATGTTGAGATAGATCAGCTGAAATCTTCTTGCACAGTAAAGCAGGGGCCACCTG 
TCCTGGAGTTACATTCATCTTGTTCATTGTCAACGATTTGTGTTCAGTGACACCCTCTTC 
AGCCCAAGAACTTACCTGGGTGCTGTGACAATTGGACATGACTAGGAACAACCAGTGACA 
(T, A] 

TGTAGCCCATCCAAACACAGGGTAGGAAGTGGATGCTTGTCACTCTCTTTTGGTTATAAG 
AAGC AG GAACCC AGT AAAG GC AC C TTT T AT AT ATCT AT AAAGTTG AAT AT AT AAG AT AT A 
TGGGGGCCAGGCACAGTGGCTCACACCTGTAATCCGAACATTTTGGGAGCCCAAAGCAGG 
TGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGACCAACATGGTGAAACCCCATCTT 
TACTAAAAATACAAAAATTAGCTGGGCGTGGTGGCACACACCTGTAGTCCCAGCTACTTG 

74171 GTAAGTGGGTGGAGCTGTGATGTGAAACTGGGCAGTCTGATTCTGGGACCTGTGCTCTTA 
ATCACCAATCTATATTGCCTCCTACTTGAAAACATCCAGGGAAAATGTTGAGATAGATCA 
GCTGAAATCTTCTTGCACAGTAAAGCAGGGGCCACCTGTCCTGGAGTTACATTCATCTTG 
TTCATTGTCAACGATTTGTGTTCAGTGACACCCTCTTCAGCCCAAGAACTTACCTGGGTG 
CTGTGACAATTGGACATGACTAGGAACAACCAGTGACATTGTAGCCCATCCAAACACAGG 
EG, A] 

TAGGAAGTGGATGCTTGTCACTCTCTTTTGGTTATAAGAAGCAGGAACCCAGTAAAGGCA 
CCTTTTATATATCTATAAAGTTGAATATATAAGATATATGGGGGCCAGGCACAGTGGCTC 
ACACCTGTAATCCGAACATTTTGGGAGCCCAAAGCAGGTGGATCACCTGAGGTCAGGAGT 
TCAAGACCAGCCTGACCAACATGGTGAAACCCCATCTTTACTAAAAATACAAAAATTAGC 
TGGGCGTGGTGGCACACACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGATACTTG 

74 918 TAACAGGTGCTGAAAACAGGAACTGGGAAGTTGCCAGTACCTTCCTGTCTTTTCCCCTGG 
AACCAAACGGTTTCTTACTTGCTTCTCTCTGCACCTCTGTCTCATTTCCCTCTCTCTTCA 
GATGATTTTTCATTGTTGCATCACACACATAGAAAAATCAGGATCCACCCTCCCAAGTTT 
ACATATCGTTGTTTCAGGCAGCCATAGTATCCTTAAAACTCCACATTCCAGGGAGAAAGC 
TT GGGT C AAGG ATT C AGCC AAAGGG CAGCGAAATGG AGT AAAG ATGC AACTGCCAGGTCT 
(A,G) 

TGGGCAGCAAGGAGGCCGGGAAGGAAGCCGCTGTTGTGGTCCAAGTGACAATTCAACAGC 
TCAAAGCATAAGTAAGTTGTGTGCTTTTCACAGATGGAGAAACTGAGGCACAGAAGGAAC 
CTGGCTGGGGTCCAGGTCTCTGGCCTTTGTGTCAATGCTAGGTCACTGGATGTGGCGTCT 
GATTTCTACAGGAAATGTGGTTTCTCTACTTTGTCCCAGAGCCCACTCAGAGCACTGGCT 
GGCCAGGGGGTCCTAGGGCCCTCTTAGGATAGTCTCAGGCCAACAGCCCCAGGACAGAAG 

75386 GGATGTGGCGTCTGATTTCTACAGGAAATGTGGTTTCTCTACTTTGTCCC AGAGCCCACT 
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CAGAGCACTGGCTGGCCAGGGGGTCCTAGGGCCCTCTTAGGATAGTCTCAGGCCAACAGC 

CCCAGGACAGAAGCAACCAAAGTGAAGTTATGAAAGAAAGCTCTTTGCTGATCTGTCAAT 

GGCACCCTTGTAGAGCCAATACTTAGAACACCTGGATTTGAATACTCATCTCCAAAACCT 

GTGTTCTTTCTACCACGTGACAAGCCCTTGTAAACCTCACAACGTCTCTATGAGGTGAGC 
IG,A1 

CTT GC AG AT CC AC AC TTT AG AT AAGCAAAT GG AGGCTC AG AGGGT AAGC AGCT AGTTC AA 
GGTTATGCACCTGAGCCAGGATGTGGACACAGCTCTGTGTCTGATTCCTAAGGGCCTGTG 
C T TT AGCC ACT T TGC AATAC TGC TGCTGTC TGCTTC ATTTCCTC ATC TGTC AG ATGGGAA 
CGATAATACTCAACTCACATGGATACTGTATGAGGAAAAACAGATAAAAGAAGAGAAAGT 
GCTTTGAAAACATAAGCAGCCCTGGCAGATGGGAATTATTTTTGCTGCTGACACACATCC 

CAATGCCAATGTTTCTGAAGCCCATATTAAATGCCAAAATCTGAGTCAGCTACTGGAGGT 

AGAGACATGAATAAGATGGTCCATATTATTTTAGAGGATTCTTTGGTTGCAAAGGGCAGA 

CACCCAGCTTGAATTCACTTTGGAGAAATTGGGATTTTTTTGGCTTGCATAAGCAAAGCA 

TGAGAAAGAAAGTTCCAGGGATGATGAAAACCAGGAATGCAAATGTCTCCAGAATTCTTT 

CTTTTTTCTTTTAGGCCATCTTTTTTCTCTCAAACTGGTTCCCTCCACTGGGCTGGAGAC 
(G/ A] 

TTACTACCAGCAGCACTCAGACCCACATCTTCAGTTTAAATGTTGGAAATGGACTGTCAG 
AGAACATTTAGGCCATTCATTCTGTGGGAGAGATAGGCTATGTAAAAAGATAGCCACTCC 
CATGTGAACAATGTGGTTAGGATTAGAGGCATGAATATACCCCAAACCAGGGGTGTGGGA 
AGGAGGTTGACACTCTAGGTGATAATACCCAGACCTTAAGGAGCTTTCTGTCTAGAGGGA 
GGTATGGACATGGACAAGTAATCAACAGCTACAAAGCAGAGCTGCCAGCTCTGCAACACA 

ACCTTAAGGAGCTTTCTGTCTAGAGGGAGGTATGGACATGGACAAGTAATCAACAGCTAC 

AAAGCAGAGCTGCCAGCTCTGCAACACAAGAGCCCTGAGAGGCATGACAGGGGCAGGGTG 

GGGATCCATGTGGGTCTGGATTGAAGTGAGGAGGGGCATCAGGAAAGCATTCCAGGAGAG 

CTGAGGGACACTTGAGCACACCCTCAAAGAATGACTGGGGGTCATGAGGTATACAAGGGA 

GGAAGTGCACCCGAGACAGAAACAATCACATAAGCAAAAATGCAGAAGAATATGAGGATC 
(G, T] 

GGGAAGGGC AAGT AGCTC AGT AGT GTT GG AGGCC AAGGG AC ACG AAG G AAGGT G AT AAAG 
CCCTGATGTTAAGGATAGAAAAATCAAAGTCCTTTGAAAATCATGTGGAGTTAGGATCTC 
AAGAACCCTACAAGGATTTCTTTAGAATAGAATCAAAGAAAAACAAAGTTTACAGTCTGT 
GAGGGTTGCATAGGAAGTAACGTGGTGAGAAATGTTGGCTTGAGAACCACATATCCATAA 
CACAATGGTGTTTTAGAGGATTTGGGGGAAGGGAGAGAAAATCTCAAATTGTCTCAGTAA 

GCATCATATTGCATGAAAACAGCAAACGGAAGTCACAATGGCTCGACGGTGTAATGAAGC 
CACACAATATGTATTAAACACATCATCTACACAGATGGATTCAAAGATACCTTCTTTGTG 
TCTAAGTCCCAAATCTGTGTTTCCTGGCTCTGTTCCCTCATATCTAGTCATTCTCCAAGT 
CAGCATGCCCAACTTGAAAGTGTCATTTTCAAAACCTGCTTCTTCTCTTCTGGAAGTTCT 
TCC TCTGCCC AT TGC TCC AC AATCCCC ACC TCTT TC ACCCAGT AGC AAACCTT AAATTT A 
IT, A] 

CTTTTACTTTGTCTTACTTCCCCTTCTTATATTCAAAATGTTTCTCACTTGCATCTCTTT 
TCATTCATTTCATAAGCATTTATGAGCTCCTGTTATGGTTTGGAAACTGTTCTTCATGCT 
GGAGGTGGTCTTATAAACAAGTAATTTCAATTGAGTATTTAGTATGTTAAGTGCCATCCC 
AAAGGCAAACACCAGCTGTGGGAGGCTCCCCAAATCAGTCTAAGGAAGTTGGGAAAAGCA 
TCTCAGAGAAGATGGTGTCTGAGATGGGGAGGATGTGTGGAACTGGGCAAGGAAGAGAAC 

TTTGGGCAATTGTAGCAATTTTAAAACTATGTTAGATGGCTAGAGATTCTTGAGAATATT 

TCTTTTCTTGGAAAATCATAAGGCTTTGGATAGTGGTACCTATAGAAGCTGACATCAGCA 

GCAGCCTGCCTCCAGTCGATCAGGGCCTTTGGAACTTCACGGGGCTCCTCTACTGACAGC 

CCCATCGGTTTCCCTCCAGCACACGTAACTCAGCATTGACTCTGGGTAGTAGAGGGTGGT 

TTATGGAATCTGATTCATCTCAGAAAGAGGTGGATGCAAACACATTCCCAGAGCAGAAGG 
tC,T] 

T TGGC ATGTC TGG TC TT AGGC AG AGGG AACTGG AG AT ACTTGTCCT ATTGTTCTTG AGAT 
TCCAGCAAAAATAGCCCATTACAGAGGAAGAAGATATCAGGTCAAATGAAGGCTTTGGTG 
CT AC AAC AT TGTC T TAG AAAAAAAAAG AAAG AAATTGGCCAAGTGCAGTGGCTC AGC ACT 
TTGGGAGGCTGAGGGGGGCAGACCACTTGAGATCAGGAGTTCGAGACCAGCCTGGCCAAC 
ATGGCGAAACTCCGTCTCTACCAAAAAGTATTAAAAAATAGCCGAGTGTGGTGGCGGGCT 

CCTTGGGGC ATC AC ATT AAGT AGTT AC C AG ATTG AACTGC AAAC ATT GCT ATCC AGG AG A 

AATCAGGTCAATATTTCACCTTCATGGCAATACCAGTACAGTCCAAGGAGAATGCATAGA 

AGGAAAGAAATCATAATCTGATTGTATGTGTTTTTTTAGTAGTAAATAATAATAATTATT 

ACTATTCCTATACAATTTTGTGTGTTGGTGTGTTTTGTTTTGTTGTGCATGAAAAATGGG 

GTGCTAATCTATTCCCCTTCCCAACACCAGTGCTCAGAAGAAATTTCCACAGATAGAGAA 
[G,T] 

CTATAGGTTATGAATTTGGCCTTGATGGATTCTGGGTCACTATTTCTCAATGTTTGTCCA 
TGTCATGTGAAGCTCTTAAGATAAAGAACAATGTCTTACTCGTCTTTTTAACTTCTTTAC 
CCCCTAATGCCTATCACATACTTTGCCCATGGAAACTCAATAGACATTTGTAAATGGAAT 
TTAATTTCTGAGGTCCAGTAAAGCCTTTTTCCATCCTTCCCCTACTACACAGTTTGTCTA 
ACCATGTCTTCCCTTCCATCATCCACCTTATAAACGTTATTACTCATTCTTCCATCACAT 

CTCCCTACCTGTCCCTCCTTGACCCCAGGAAAAATTGCCGGGATATGAAAGTTAATTATG 

ACCCAAGGGAATTGGTACAGATGGGGAAGAAAGAAATGCATTCAAGAGCATTTCCATCAG 

TATTGAAATTACACAGAAGGCTGGTGAATTTGGGCTATCCATTCTTGCCTCCCTCTGTGC 

CCATAATTCCTTGGCCTCCTTCAATTTCATTTTCCCTTTGGTTCAGAGGAATGCTTGATG 

GCTTAAGCTAGCCTCAGTTGGCCAAGCATTGGAGAAACAGAGAGGTGTATGACACAGCTA 
[C,TJ 
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ACTCCCATGGGGCTTACAGGGCAAGGTGAGAGAAGACAGAAGTTGTATGTGCTGGGTGCC 
ACGTGGTAGCTACAAACTAGAAATGAGACCAGGTTCGGAAGAGGAAGAGGGCTTGCAGAC 
C TG AGTC AT GG GG AC AGT TT CTTC AGG AAATGGG AT CTC AG CTC T GCCTT GT AT GC AGGG 
CTTACATAATAAATATGTTTCATTGTTGTTGTTGTTATTGTTGATTTAATAAGATTTTGT 
TTTAAGAAGATTTTGTAAAAACAACTGAACAAATGCAATCTCCTGCCAGAGCAGGCAGCA 

GCAATTGGTACAGATGGGGAAGAAAGAAATGCATTCAAGAGCATTTCCATCAGTATTGAA 
ATTACACAGAAGGCTGGTGAATTTGGGCTATCCATTCTTGCCTCCCTCTGTGCCCATAAT 
TCCTTGGCCTCCTTCAATTTCATTTTCCCTTTGGTTCAGAGGAATGCTTGATGGCTTAAG 
CTAGCCTCA GTT GG C CAA GC AT T GG AG AAACAG AGAG GTGT AT G AC AC AGC T A C A CTCCC 
ATGGGGC T T AC AGGG C AAGGT G AG AG AAG AC AG AAGTT GT ATGTGC TGGGTGCC ACGTGG 
IT,C) 

agctacaaactagaaatgagaccaggttcggaagaggaagagggcttgcagacctgagtc 
atggggacagtttcttcaggaaatgggatctcagctctgccttgtatgcagggcttacat 
aataaatatgtttcattgttgttgttgttattgttgatttaataagattttgttttaaga 
agattttgtaaaaacaactgaacaaatgcaatctcctgccagagcaggcagcagcaaagg 
agattaggaatataacccccttggagacgttccttcacctacctggtgctggattaccta 

87866 tgc attc aagagc atttccatcagtattgaaattacacagaaggctggtgaatttgggct 
atccattcttgcctccctctgtgcccataattccttggcctccttcaatttcattttccc 
tttggttc aciaggaatgc ttg atggctt aagctagcctcagttggccaagcattggagaa 
acagaga^rrctatgacacagctacactcccatggggcttacagggcaaggtgagagaag 
acagaagttctat.;tgctgt^tgccacgtggtagctacaaactagaaatgagaccaggtt 

fC,T) 

GGAAGAGGAAGAGGC TTTG'1A.*ATCTGAGTCATGGGGACAGTTTCTTCAGGAAATGGGAT 
CTCAGCTCTG .% TT iT A T G : A CO" 1CTTAC ATAATAAATATGTTTCATTGTTGTTGTTGTT 
ATTGTTG ATTT AA T AA , A TTTT JTTTT AAGAAGATTTTGTAAAAACAACTGAACAAATGC 
AATCTCC TOT C Ai iAG IA ViC A.. -7 AGC AAAGGAGATT AGGAAT AT AACCCCCTTGG AGACG 
TTCCTTC ATCT A" ; TGGT^TT J- ^TTACCTAAAAGCTTCAGCTAAGTAGGGTCACCCCCC 

88238 CTTGTATGC A. TTAC ATA/. ? AA *TATGTTTCATTGTTGTTGTTGTTATTGTTGATTT 

AATAAG ATTTT. TTTT AAGAA . .A TTTTGTAAAAACAACTGAAC AAATGCAATCTCCTGCC 
AGAGCAGGC AGC A V:AAAGGa,.A rTAGGAATATAACCCCCTTGGAGACGTTCCTTCACCT 
ACCTGGTGCT'^^TT AC I T AAAA' -TTTCAGCTAAGTAGGGTCACCCCCCCAAGAAATTAT 
TTTAAAAAAA TTGAAATCTCA T A TTTTTAGAAAATCTTATCAAGGATATTT AATTGGACT 
[A,C] 

T TT AC ACCT A TTT AC#G< iT C AG T r GGTTTTGGAC AAGT AT GC AGGGGTCT TGG AATC AG AC 
C ACTGGGGTC AAA T C* T Ai rTTCT G TCACTTCCTAGCTGGGTGACCTTGGACAAAGTTACC 
TGACTTCT A AT AGCTTC AGATT J CTCATGGGCAAAATAGAAATGCTACTAGTACTTAATA 
GTGCTCTGAGAAG.3ATTJAATGAGAAGGATTAAATGTATGTAAAGCACAGTGTTTGCCCA 
T AGG AAGCTGTT A TTT ATAA GGG AGGGG AGC ATCCT AAGGTCCTCCG AATTT AGG AGAAC 

8921 9 AGAGGGCAGTTGGAAAACTTC ACAAGACAATCCAGCCTGATTGTTTTGACATGCCTGACT 

TCAGGCTGCTAAAAATCAGCTCGAGGAATCAGATAGGAAAAAGAGATAGGTGATGCAATT 
TTATTCCATCTCCCAATTTTCTGAGTCAAGAGTTGTTTGTTTAACTCCAGTTAAATTAGT 
ATTTATCCAAATTTCCTGGGTGCTTGTCCAAAGAAAAGTACCCCAGATCTACAAATTAGA 
ATCTGGGACTGGGACTTAGOAATTGGCACTTTTACAATTATACCAGATGTTTCTAATATG 
[A,GJ 

GTACTTCAACCACTACCCTTATAGAAGTGCTGCCTAGGACCCTCTCTTCTGGCAGGTGAA 
GTGGAAGGAGGTTTTCTCGAACGGAGATTCTCCACTTCAACTTGAGTGTCTTGGCTTGTA 
TCCGCTTTGTTTGGTTCTATTTCACCAAAGGCTTTCATCTTCACATAAATTTTCTTCAGC 
TTTAAATAATTAGTTTTGGTAACCATTGGTATACTGGAAAGAACATTAGATTTGGAGTCC 
AGGTGGCTTGAGTTCAATTCTCTGCTCTGCCATTTACCAGCTGTGTGACATTGGGCAAGT 

89331 ATGCAATTTT ATTCC ATCTCCC AATTTTCTGAGTCAAGAGTTGTTTGTTTAACTCC AGTT 

AAATTAGTATTTATCCAAATTTCCTGGGTGCTTGTCCAAAGAAAAGTACCCCAGATCTAC 
AAATTAGAATCTGGGACTCGGACTTAGGAATTGGCACTTTTACAATTATACCAGATGTTT 
CTAATATGAGTACTTCAACCACTACCCTTATAGAAGTGCTGCCTAGGACCCTCTCTTCTG 
GCAGGTGAAGTGCAACGAG G T7 TTGTCGAAGGGAGATTCTCCACTTCAACTTGAGTGTCT 
[T,C) 

GGCTTGTATCCGCTTTGTTTGGTTCTATTTCACCAAAGGCTTTCATCTTCACATAAATTT 
TCTTCAGCTTTAAATAATTAGTTTTGGTAACCATTGGTATACTGGAAAGAACATTAGATT 
TGGAGTCCAGGTGGCTTGAGTTCAATTCTCTGCTCTGCCATTTACCAGCTGTGTGACATT 
GGGCAAGTTGCCAACCTATCTATGTCATTTCCTCATGTAAAGATAATCCCACTTCACCAG 
GCCACTTTTGAGGACCCAGTGAAATGATGTGTAACCATTTTAGGAACACTGGATCATTCT 

90794 G ATTT GGC C AAG GT C AC AC AGCT AT AAGC AGT AG AACT AAG ATTTT AACT C AAGTTTCT A 

TGGCCCCAGAATTTATGTGTTTCTCTCTCCATACCACAGGGACAGGTGCAAGTGAGAGAT 
TTTGCTGGAAGCACTGGGCTTTTTGAGCAGGCCATATAAAAATTCTGAGCCCAGAGCTCA 
ACTAAATTATTGGAAGAGACTGGGCCAAATATAAGGCTTCTATCTAAGCAGCACCTGTGT 
TTCTCAAGGACTGAGGAAAATGAAGGGGGAGGGTTGGCAAGGCTGCATTTCCCAGGGTGC 
[A,G] 

TGATTATATGGCATGGGGGTGGGGGCCATTATGATGCCCGGACATGGAACTTACACCAGT 
GCAGAAAGGGTGTGATTAGAAGCCCTAAGCCAGAGAATGTTCAGTGTGATAAATGCCATT 
ATTTTTTCCCTCATTCATTCAATAGATTTTTTTTTTAGATGGAGTCTCACTCTGTCGCCC 
AGGCTGGAGTGCAGTGGCACCATCTCAGCTCACGGTAACCTCTGCCTCCTGGGTTCAAGC 
AATTCTTGTGGTCCAGCTTCCTGAGTAGCTGGGATTACAGATGTGCACCACCACGCCTGG 

FIGURE 3, page 47 of 57 



87837 



5NSDOCID: <WO 0233086A2_I_> 



WO 02/33086 PCT/USO 1/32 152 



92404 TGCTCTGCT ACCTACCTGCC AGCTGTTTCCCAGGGATGTGGT AAAGATGAATGGGCAAGA 

TCTGGGAAAGTGTTTTGAAATCCTTGATTAAAGGCCCTCCAGGCAGATGTAGAATTTTAA 

ATGTGTTATATTACTGCCACTATTGTTATGCTTTCTTTTATCACCCCAGAATTTCACCAT 

CTCCTGTTTCAGGTGAACGAGTCTGCCTGACTCTTACCTGCCCTGAATGGCATTGGAAAG 

GTAGCAGCCCTGAGATGTGCCATATAAACAAACATGTTTTTAACCAAGGGATCAGGAGGC 
[C,T] 

TTCCTGGCTGGCTCCTGTCAGCTGGTCATCACCTCTCTATAACTCTAGGCTTTCCCAAGC 
TTATTTTATTTCCATCAATAGGACAGGAATATGTAAATGTCCTGCTTGAAATGAGTATTG 
GCTACAAGCCATCTGCCTCTGAACAGAGGTGAAAAGTGGAAATCGGAGGAAGGGCAGATG 
TCTTTTGCAAGGGAAACAGACTGTTTTCTGCCACTGCACTCTGCCCAGGCAAAAGAGTAA 
AGGAACAGCACTCAGGAGAATTCACTGAAGCGAGGGCAGGGTGCAAAAGGAACTTGAGAA 

92672 CAAACATGTTTTTAACCAAGGGATCAGGAGGCCTTCCTGGCTGGCTCCTGTCAGCTGGTC 
ATCACCTCTCTATAACTCTAGGCTTTCCCAAGCTTATTTTATTTCCATCAATAGGACAGG 
AATATGTAAATGTCCTGCTTGAAATGAGTATTGGCTACAAGCCATCTGCCTCTGAACAGA 
GGTGAAAAGTGGAAATCGGAGGAAGGGCAGATGTCTTTTGCAAGGGAAACAGACTGTTTT 
C T GCC AC T GC ACT CT GCCC AGGC AAAAG AGT AAAGG AAC AGC ACTC AGG AGAATTC ACT G 
[A,C] 

AGCGAGGGCAGGGTGCAAAAGGAACTTGAGAAATTGGTACTGGGACCCAAAATCAGATTC 
TGGCATTTCTGGGAAAAGAAATGGGCATGGGTGGGGGTTTTATCTGTCAATAAAAGCATC 
CAGAATGGGGCTAGAAGGAAGTAAATTCAGTTGCCACCTCTGCCTACTGGACAGCCACGG 
AGAACTTCTCCTTATCCAAGGTCGAGGAGCCCTCCGGAGTACATACTGATACCATTGGTT 
CTCCCACACATACCCCCATGGAGATAAAAACAGGACCCTGGAAGCCCTGTCCGTGTTTAA 

92684 TAACCAAGGGATCAGGAGGCCTTCCTGGCTGGCTCCTGTCAGCTGGTCATCACCTCTCTA 

taactctaggctttcccaagcttattttatttccatcaataggacaggaatatgtaaatg' 

TCCTGCTTGAAATGAGTATTGGCTACAAGCCATCTGCCTCTGAACAGAGGTGAAAAGTGG 

AAATCGGAGGAAGGGCAGATGTCTTTTGCAAGGGAAACAGACTGTTTTCTGCCACTGCAC 

TCTGCCCAGGCAAAAGAGTAAAGGAACAGCACTCAGGAGAATTCACTGAAGCGAGGGCAG 
[A,G] 

GTGCAAAAGGAACTTGAGAAATTGGTACTGGGACCCAAAATCAGATTCTGGCATTTCTGG 
G A A A AG AAATGGGC ATGGGT GGGGGTTT T AT CTGT C AAT AAAAGC ATCC AG AATGGGGC T 
AGAAGGAAGTAAATTCAGTTGCCACCTCTGCCTACTGGACAGCCACGGAGAACTTCTCCT 
TATCCAAGGTCGAGGAGCCCTCCGGAGTACATACTGATACCATTGGTTCTCCCACACATA 
CCCCCATGGAGATAAAAACAGGACCCTGGAAGCCCTGTCCGTGTTTAACCAATGGGATTG 

93132 CTGCCTACTGGACAGCCACGGAGAACTTCTCCTTATCCAAGGTCGAGGAGCCCTCCGGAG 

TACATACTGATACCATTGGTTCTCCCACACATACCCCCATGGAGATAAAAACAGGACCCT 

GGAAGCCCTGTCCGTGTTTAACCAATGGGATTGAAACATGGAAATGAACTGCCCCACAAT 

CCACCCTGTGAGAGACCAAAGAGCAGTGTTGGATTAACAGGGAATGTTACCCTGAAAAGG 

CATTCAGCTTCCACTGGGGCAGCAGGTACAGTGCAAAGATGATCCCACTTAAATTCCTAA 
[G,CJ 

ACAGGAAATAAGGAAAGATGTTGTGGAAACTCAAGACCTCTCAAAGCATACTCCTTTGTA 
GTTCTTCCGCAGACCAGACCACGGAATTCAGAAAACACCCTACCTGGTTCCAAACCAGCA 
CCTGCCAAACTTCTCACCCTCTTCTGACCCTGTCCTGGGAGTTAAGAAAAAAAAAATCAC 
TTTATTGGTTGCTCCAGTTATAACTTAAACAGACAGACCATCATCAAATTAAGTGACATG 
TACGACTGCTTATTGTATGCCAGTTACTGTGCTGTGGGGTTTTGGTTCCATTATCTCATT 

93537 TGGTTCCAAACCAGCACCTGCCAAACTTCTCACCCTCTTCTGACCCTGTCCTGGGAGTTA 

AGAAAAAAAAAATCACTTTATTGGTTGCTCCAGTTATAACTTAAACAGACAGACCATCAT 

CAAATTAAGTGACATGTACGACTGCTTATTGTATGCCAGTTACTGTGCTGTGGGGTTTTG 

GTTCCATTATCTCATTTAATCCTCTCAAAAACCCTGTTAGGTAGGTTTTATTATTGCACT 

CATCTTAGATTAAGGAAACTGAGGCTCATAGAGATTCGGTAATTTGTCAAAAGCCCTAAA 
CA, TJ 

CATAATTACTGCCTCCAGATGTCTCTGATTCTAAGGCCCAGGCTCTTAATCAGTAAATGA 
TCAAATGAATAATGATTTTCATGGCATCTGTCATCGGAAAGAACAATGGAGAATATGCTT 
A ACC AAA GTC A TAACC AAATAAATG AAC T TGAC AGC AG AGCCGTGATTCT AGCC AAG ATG 
ACTATTTTCATGCATGTTTTGAAGGCCAGGAAAAGGAGGTTAGACTTGTTTGGGAAGGGA 
AACAGGAGCTATCAAGGTGAACTTTTCCTAAGAGTAGCCCAATAATAGTGCTCGGGAGGG 

93557 CCAAACTTCTCACCCTCTTCTGACCCTGTCCTGGGAGTTAAGAAAAAAAAAATCACTTTA 

TTGGTTGCTCCAGTTATAACTTAAACAGACAGACCATCATCAAATTAAGTGACATGTACG 

AC TGC TT A TTGT ATGCC AGT T ACTGTGC TGTGGGGTTTTGGTTCC ATT ATCTC ATTTAAT 

CCTCTCAAAAACCCTGTTAGGTAGGTTTTATTATTGCACTCATCTTAGATTAAGGAAACT 

GAGGCTCATAGAGATTCGGTAATTTGTCAAAAGCCCTAAAACATAATTACTGCCTCCAGA 
tT,C) 

GTCTCTGATTCTAAGGCCCAGGCTCTTAATCAGTAAATGATCAAATGAATAATGATTTTC 
ATGGCATCTGTCATCGGAAAGAACAATGGAGAATATGCTTAACCAAAGTCATAACCAAAT 
AAATGAACTTG AC AGC AG AGCCGTGATTCT AGCC AAG ATGACTATTTTCATGCATGTTTT 
GAAGGCCAGGAAAAGGAGGTTAGACTTGTTTGGGAAGGGAAACAGGAGCTATCAAGGTGA 
ACTTTTCCTAAGAGTAGCCCAATAATAGTGCTCGGGAGGGAGTAATGTGTGCAAGAATAG 

95067 AGAGAAATGGAAGCAGGGAGATAAATTAGGTGGTTATTGCAAGAGGCCAGGTAAGAAGAG 
AAAGTGGTTTAAGTAGGGTGGTGTGGCAGAGAAGACGGTTCCAAGCAGAGGGGGACCACG 
C TGA C A AAT AAGC GCGGGCC AC TCACGCAAGCCC AAC AAGGC AG AAGGC AG AAGGCAAAA 
GTGAAGGCCAGAGAAAACTGGACACCACCTTTCCAGAGCACAGTTCAAAGGCAATGTCCT 
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CAAAGAAGACACTCCACCCTCCTCCCATTTCCTCCCTATTGCCTAAAAATAAGAAGGATA 

GCGGCCTATGGCAAACCTTGGGCAGGCACGTGGGAGCTGAGCTCTTGCAAAGGGCAGATA 
GTTCCTCTGGTGAGAGAGAAAAGGAAGGGCCAGTGAGGAGTGAAGGAAGAGACGAACAGA 
GAGCCCGAAAGGCTGAGAACGTTGTCTGGCTTCCTGAAAGGCTTAAGGGGTTAGCTCTGG 
AGGGTGAACTAAAAGCCCTAGTTATATTAAACACACACGCACACACGCACGCACACACAT 
GCGCGCACACACACACACACATACACACAGTTGAAGGAGACCTGCAGTTTCCAAAAACAA 

96000 TATTATTGATCTTGATTACTGAGTTTTTAGGTGTACCCTTAAATGTTGCACCTCTGACTT 
ACTAGTCTCACCCTGATCCCTGTCCTGGATCTATGCCTGTCTGTTCTATATCAGCCTCTT 
GC T T TG ACC A T AAGAATAACTTCAGACC TTT AAGCATAGAGG AAAT AGG ATTTC TGTCTC 
CCTTCCCCACCTTTGTGATAATCTCAGCTTCTGCTTTTAAAGTCTATCTCCCAAGTAGTT 
TGCCTACTATGTTCCTCCCAAGGTCACTAGGTTCTGTGAAACTAGCAGCAGGCTAGATTG 
[T,CJ 

CACATTAGCACAAAGGATCCACTATTCCTGCAGCCGAGCTGGGACAAGCACTTAGGCCCA 
CTGACTCCAACGCTTCAATAGCCTGGGACCTACGTTGTCTCCAGGTGGTATAAAACAAGA 
ATTTCCCCTTTGACTGGGAGAAAAAGGGAAGAACTCTAAATTGGAAAACAGGTCATCTCG 
AATTCTCACAGGTGGAAATTTCTGACAACCCCTTTGGGACCCACAATTCAACACACCCCA 
AATGGGGACAGTAGCTAACATGCAACCTGTAGGCTGTTCTGTCATCCAGTGCCACTGTGC 

96877 GGGAATT AAGGTGGAAGGCAGGGCGTTTTGACTGCATTTGACCCAAGTCCTGAAGAGCC A 

GCTCCTCTCTCTTCCTAATTATTAGAAGGTTTTGTTTGGACCCAGTGTTTCACGTGTATA 
CAATACAAACTTCTCTCTTTTCTACTTGGATCAAATTTGTTCTCTCAAAATAAGATTCCC 
AGCAGTGAGAGAAGACAAGACAGAGAGATCCAACATCTCTAAAGCCATGAATCAGATAAC 
CAGCCACTTGTTCTCTTCAGTGCTGGGAACAGATACACTGTTAAATAAAATGATTTTATA 
(G, T] 

ATTCTTCTCACTGCCTTTCCAAGAAGGGGATTTATCAACTTCAGGGCACAGCAATCATTT 
ATTCCCAGACTACTGGCATGCATATATATATATATTTACTTCTCTTGACTTAGAAAAAAG 
AGAGAATTGGAGTTGTGAATATTCCTGTCTCCCTCACCCCAGCCCCCTTGAAGTGAGTCA 
GGACAAACTTGGGGCCCAAATGGAGCTGTAAGTAACTGAGTCACATGCAGAGATGAAACC 
TTCACAGACCCACTGATATGGAGGTTGAAGATTAAATTCCCCTTTGAGAATAACTGGGTA 

97271 ATTTACTTCTCTTGACTTAGAAAAAAGAGAGAATTGGAGTTGTGAATATTCCTGTCTCCC 
TCACCCCAGCCCCCTTGAAGTGAGTCAGGACAAACTTGGGGCCCAAATGGAGCTGTAAGT 
AAC T GAGTC AC AT GC AG AG ATGAAACC TTC AC AG ACCC ACT G AT ATGGAGGTT GAAGAT T 
AAATTCCCCTTTGAGAATAACTGGGTAACACTCATACAGAGACTACTTTCAAGAAGGCCA 
GATCCTCCCTCTAATGTATAGTGCAACGTTCCTAACCCTCAGCCCACTCCGTCATACCCC 
[A,C] 

ACTCACATGAATACACACATAAGCAGTAATATAAAGCACTTCCCACCATAGGGCAGCAAA 
GAAGGAGGGAAATCTTTATTATGGAAGAGTGGAAGGAAGGAAGGGAAGGGAAGGGAAGGG 
AAGGGTAAGAGGAAGAATTCTCAGGGTGAGCAGAGGAATGACATGTTTGGGGCATAATGA 
AGATAATTGAAGTGCAGAGTTTGTATGGAAAAATTTGAAAATATCAGGTGGCAGGCCAGG 
CATGGTAGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAAGCAGGCGGATCACCTG 

97470 ACTGGGTAACACTCATACAGAGACTACTTTCAAGAAGGCCAGATCCTCCCTCTAATGTAT 
AGTGCAACGTTCCTAACCCTCAGCCCACTCCGTCATACCCCCACTCACATGAATACACAC 
ATAAGCAGTAATATAAAGCACTTCCCACCATAGGGCAGCAAAGAAGGAGGGAAATCTTTA 
TTATGGAAGAGTGGAAGGAAGGAAGGGAAGGGAAGGGAAGGGAAGGGTAAGAGGAAGAAT 
TCTCAGGGTGAGCAGAGGAATGACATGTTTGGGGCATAATGAAGATAATTGAAGTGCAGA 

TTTGTATGGAAAAATTTGAAAATATCAGGTGGCAGGCCAGGCATGGTAGCTCATGCCTGT 
AATCCCAGCACTTTGGGAGGCCAAAGCAGGCGGATCACCTGAGGTCACGAGTTTGAGACT 
AGCCGGGCCAACATGGCAAAACCCCATCTCGACTAAAAATACAAAAATTAGCTGGGTTTA 
GTGGCGCATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCATTTGAGCC 
TGGGAGGCAAAGGTTGCAGTGAGTCGAGATCATGCTACTACACTTCAGCCTGGGTGAGAG 

97518 CCTCTAATGTATAGTGCAACGTTCCTAACCCTCAGCCCACTCCGTCATACCCCCACTCAC 
ATGAATACACACATAAGCAGTAATATAAAGCACTTCCCACCATAGGGCAGCAAAGAAGGA 
GGGAAATCTTTATTATGGAAGAGTGGAAGGAAGGAAGGGAAGGGAAGGGAAGGGAAGGGT 
AAGAGGAAGAATTCTCAGGGTGAGCAGAGGAATGACATGTTTGGGGCATAATGAAGATAA 
TTGAAGTGCAGAGTTTGTATGGAAAAATTTGAAAATATCAGGTGGCAGGCCAGGCATGGT 
(G, A) 

GCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAAGCAGGCGGATCACCTGAGGTCAC 
GAGTTTGAGACTAGCCGGGCCAACATGGCAAAACCCCATCTCGACTAAAAATACAAAAAT 
TAGCTGGGTTTAGTGGCGCATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGA 
ATCATTTGAGCCTGGGAGGCAAAGGTTGCAGTGAGTCGAGATCATGCTACTACACTTCAG 
CCTGGGTGAGAGAGCTTTCTTTTTTTTCTCTCACAAAAAAAGAAAAGTTCAGGTTGCAGA 

98476 TGTCCCTTTCCCTCTAGCCACAGGTAACACGCTCTCCAGGCACTGGGAAAGTGGGTAATT 

AGGAAAGCAGAGGAGTACCCATGGGCTGTGATGCCCAGTTATAAACCCAGACATTTCAGA 

ATTAACAGAATGAGCATCAAGTCCTCAAATGGGTCTACATCCATAAACATGTCCAGCAGT 

CAGCTCTTTACTGTCAGTAGAGACAAAATGTTCCTACACTTTCCCTAGGGGAAGCCACAT 

CCTCAGTAGGTTATCTCTGATGAGTCCAGCTAGTCACAGGTATGTAGAAGCTGCATGCAG 
[C,T] 

AGAGGGCTCAAAGGAGGGTCCAGAATAGATACCAAAGCAAAAGGGGAGTCTGTGCACGTT 
CTCACACGCACCCCGAAACACTCTTTTTGTTCACAAAATAGATGGTGTAGGGTAGTTCCA 
AGAGATCATTTAGCTCAGGTTCCTGCCTCCATAAAATAAATAAGCCTTCCATATTAGTTG 
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TCTGTTGCTGTGTAGCAAATTGTCAGAAACGTAGAGGCTTAAAGCAATACCCATTTATTA 
TCTCGCAAGTTCTGTATCTCAGAAGTCCAGGCAGGCTTGACTGGGTTCTCTGTCCAAGTT 

98779 AGGGC T C AAAGG AGGGTCC AG AAT AG AT AC C AAAGC AAAAGGGG AGTCTGTGC AC GTTCT 

CACACGCACCCCGAAACACTCTTTTTGTTCACAAAATAGATGGTGTAGGGTAGTTCCAAG 
AGATCATTTAGCTCAGGTTCCTGCCTCCATAAAATAAATAAGCCTTCCATATTAGTTGTC 
TGTTGCTGTGTAGCAAATTGTCAGAAACGTAGAGGCTT AAAGC AATACCCATTTATTATC 
TC G C A AG TTC TGT ATCTC AG AAG TCC AGGC AGGCTTGACTGGGTTC TC TGTCC AAGT TCT 
[C,T) 

GTGAGACTGAAATCAAGGTGTTGGCCAGGCTGGGATCTTATCTGGAGGCTCTGAGGACAT 
ATACGCTTCCAACCTTATTCAGGCCATCAGCAGAATCCCGTCTCTTGTGGCTTGAGGTTG 
G A GG T C C CC GTTTCCTTGCT GGC TGTC ATCC AGGG ACC ACT CTTTGC ACC T AC AGGC TGC 
CTATGTTCCTATTCACAAGACACCGTTCATCTTCAAACCAAAGCAGCATGTAGAATCTTT 
CTTGTGCCTCGTATCTTTCTGGCTTTCCCTTCTTCTTTAGCCAGAGAAAGTTCTTTGCTT 

99218 CTGGCTGTCATCCAGGGACCACTCTTTGCACCTACAGGCTGCCTATGTTCCTATTCACAA 
GACACCGTTCATCTTCAAACCAAAGCAGCATGTAGAATCTTTCTTGTGGCTCGTATCTTT 
CTGCCTTTCCCTTCTTCTTTAGCCAGAGAAAGTTCTTTGCTTTTAAGCGTTCATGCGATT 
CAATCAGGCCCACCTGGATAATGTCCCTATTTTAAAGGTAACTGTGATACCGTATAACAT 
TTCAGGACTGATAACAGCACATTTACAGGTTCCAAGGATTGGGGCAGAACATCTTTGGGG 
(C,G1 

aacattttasaaactctgcctccccactcacccataatccttttaaaaaccaaatcttga 
agcc tttttttc z c aaa zcz ctttttg aat aagc ac att t at acct aacttc at c agac a 
ccc actttcav ic aaa : act agc atgtggc aaaat aggctgt aaatc aatc agaact attc 
tttcccaccar.natctrtctcaaacacattgggagaatctgacactgtcagtggtatacc 
agagcacia ct'j ct a? "a t ttc a 1 aagagctgactgtt aaat gtt tagtaattgt gg ac at 

100538 tgtaactattct:aa-.*taattt^ju>jvtgtggtttctagatctctcatcatcctagtcac 

CCTACTCTiX^ T'.r A rrc :; AAA(.TC C CTCTCAAGATATAGTGTCAGAATTGACCTAATTA 
GTCCAGCATTT.tArT "AA-*i^GCTA^ACTTTGACTCCAGCCCCCCATCCTTGACTGGCACT 

agcattc AAcr >;tt rr.c rrr-rrc cctgggtctttaatagagtcagagcgacttctcc 

AG GG G ATCTTTT'V »C ~ A 7 CCA C r ACT AGC ATC C AC AC AC GCTGGGGC C T TGTT AAAAAGG 
[C,A] 

aggctctcag^':c z :a.~ rcca r-atctactgaatcagaatccacacattaacaagatgctt 
gggtgattcar;:tcca;a7taaagtttgagaagcaccgctttcagggacgagatgacaca 
cttattttaaar.aoaacsccaattagagaccctaagccttctcatggaacaggggccttc 
ccctcagacctt^or,ac»a0ggctcagggaaatatcagtgttgggttgttggtgacaggtg 
gcggtggggc vtt jactu'c accttc aaagagccagaaacctggcaggggaagagatgggg 

101045 gg aaat atcactgttc^gtt gttggtg ac aggtggcggtggggggttcagtcc acgttca 
aagagccagaaacctgr.caggggaagagatggggcagtgacacccaaccggaaaaataaa 
ggaaactacaagaagaacccagctaagagatgtgaggcttctgaaagctcccatggaaag 
gttcgcagctcctcca:ctgctcggtccagctgccccaggtcaaggaagctctgtgagtg 
ttagctgaccc&3agcagcaaggatacattcagaagtgatgaaagggaacgcttcttgac 

(A,C] 

GGGTAAAGAGTCATTCACTAGGAATGAGACAGGAAGAGGTCACAGAGTCAGAAGCCCAGC 
CTGTACTCAGAGATTATTTCTGGCATGGGAGGGCCGAAGGGTTAGGAGGCCACCTACTCA 
C AAT AC AAT AC AGA 3GC AS ATCCACTT ATT ACCTGCCTGTGCTGCTGGGATTTC AGTGTG 
GAAATTCTGTGCCTCCTCACTGTGGCTGCAGCTTGGGAATGACATCCAGAGCTTACCCAC 
CTGCATAAGAAATAAGCTATAGGTGTAATAGGGGGACATAGGCTAAAATCCTAGCTCAGC 

101232 GCTCCTCCACCTGCTCGGTCCAGCTGCCCCAGGTCAAGGAAGCTCTGTGAGTGTTAGCTG 
ACCCGGAGCAGCAAGGATACATTCAGAAGTGATGAAAGGGAACGCTTCTTGACAGGGTAA 
AGAGTCATTCAGTAGGAATGAGACAGGAAGAGGTCACAGAGTCAGAAGCCCAGCCTGTAC 
TCAGAGATTATTTCTGGCATGGGAGGGCCGAAGGGTTAGGAGGCCACCTACTCACAATAC 
AATACAGAGGCAGATCCACTTATTACCTGCCTGTGCTGCTGGGATTTCAGTGTGGAAATT 
[C,G] 

TGTGCCTCCTCACTGTGGCTGCAGCTTGGGAATGACATCCAGAGCTTACCCACCTGCATA 
AGAAATAAGCTATAGGTGTAATAGGGGGACATAGGCTAAAATCCTAGCTCAGCTGCTTAA 
TAGCTGTGCGACTGAGCAAGTTACTTAACCTCTTTGAGCATCTGTTTTCTCATCTTTAAA 
ATGGAAGTAATCATAATTGACCAGGCCCAGTGGCTCACACCTATAATCCCAGCACCTTGG 
. AAGGCCG AGGCC AGTGG ATTGCTTG AGCCC AAGAGTTTG AG ACC AGC AT GGT G AC ACCTC 

101266 CAAGGAAGCTCTGTGAGTGTT AGCTG ACCCGGAGCAGCAAGGATACATTCAGAAGTGATG 
AAAGGGAACGCTTCTTGACAGGGTAAAGAGTCATTCAGTAGGAATGAGACAGGAAGAGGT 
CACAGAGTCAGAAGCCCAGCCTGTACTCAGAGATTATTTCTGGCATGGGAGGGCCGAAGG 
GTT AGG AGGC C ACCT ACTCAC AAT AC AAT AC AG AGGC AG ATCC ACTT AT T ACCTGCCT GT 
GCTGCTGGGATTTCAGTGTGGAAATTCTGTGCCTCCTCACTGTGGCTGCAGCTTGGGAAT 
[G/ A] 

ACATCCAGAGCTTACCCACCTGCATAAGAAATAAGCTATAGGTGTAATAGGGGGACATAG 
GCTAAAATCCT AGC TCAGCTGCTT AAT AGCTGTGCG ACT G AGC AAGT T ACTT AACCTCTT 
TGAGCATCTGTTTTCTCATCTTTAAAATGGAAGTAATCATAATTGACCAGGCCCAGTGGC 
TCACACCTATAATCCCAGCACCTTGGAAGGCCGAGGCCAGTGGATTGCTTGAGCCCAAGA 
GTTTGAGACCAGCATGGTGACACCTCGTCTCTAGAAAAAATACAAAAATTAGCCAGGCAT 

101290 TGACCCGGAGCAGCAAGGATAC ATTC AGAAGTGATGAAAGGGAACGCTTCTTGACAGGGT 
AAAG AGT C ATTC AGT AGG AATG AG AC AGG AAG AGGTC AC AG AGTC AG AAGCCC AGC C TGT 
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ACTCAGAGATTATTTCTGGCATGGGAGGGCCGAAGGGTTAGGAGGCCACCTACTCACAAT 
ACAATACAGAGGCAGATCCACTTATTACCTGCCTGTGCTGCTGGGATTTCAGTGTGGAAA 
TTCTGTGCCTCCTCACTGTGGCTGCAGCTTGGGAATGACATCCAGAGCTTACCCACCTGC 
(A,G] 

TAAGAAATAAGCTATAGGTGTAATAGGGGGACATAGGCTAAAATCCTAGCTCAGCTGCTT 
AATAGCTGTGCGACTGAGCAAGTTACTTAACCTCTTTGAGCATCTGTTTTCTCATCTTTA 
AAATGGAAGTAATCATAATTGACCAGGCCCAGTGGCTCACACCTATAATCCCAGCACCTT 
GGAAGGCCGAGGCCAGTGGATTGCTTGAGCCCAAGAGTTTGAGACCAGCATGGTGACACC 
TCGTCTCTAGAAAAAATACAAAAATTAGCCAGGCATGGTGGCAGGTGCCTGTAGTCTTAG 



101326 A A AGGG AACGCTTCTTGACAGGGTAAAGAGTC ATTC AGTAGG AATGAGAC AGGAAGAGGT 



CACAGAGTCAGAAGCCCAGCCTGTACTCAGAGATTATTTCTGGCATGGGAGGGCCGAAGG 
GTTAGGAGGCCACCTACTCACAATACAATACAGAGGCAGATCCACTTATTACCTGCCTGT 
GCTGCTGGGATTTCAGTGTGGAAATTCTGTGCCTCCTCACTGTGGCTGCAGCTTGGGAAT 
GACATCCAGAGCTTACCCACCTGCATAAGAAATAAGCTATAGGTGTAATAGGGGGACATA 
[G, A] 

GCTAAAATCCTAGCTCAGCTGCTTAATAGCTGTGCGACTGAGCAAGTTACTTAACCTCTT 
TGAGCATCTGTTTTCTCATCTTTAAAATGGAAGTAATCATAATTGACCAGGCCCAGTGGC 
TCACACCTATAATCCCAGCACCTTGGAAGGCCGAGGCCAGTGGATTGCTTGAGCCCAAGA 
GTTTGAGACCAGCATGGTGACACCTCGTCTCTAGAAAAAATACAAAAATTAGCCAGGCAT 
GGTGGCAGGTGCCTGTAGTCTTAGCTACTCGGTAGGCTGAGGTGGGAAGATTATATGAGC 



102342 ACCCTGTCTCAATAAATAAATAAGAAGAATGAAACAAGAAAGTTCTTCTTATGGTTCTCA 



TGGTGGTGAGCACAATGTAAGCATATATATTATCTTAGAATTCTTCCTTCCTGTATAAAG 
AAGGCCTCCTCCAATGTATTAATCATCTGTTCAACTAATAAATGCTGCTTACTCCCACTT 
TCACTCTAAAGGAACTCAATGGCTAAAGAGAACCCTTCCCCTTTGCAGCACCCTGAGGAT 
CAGAGGCCTGATTTGAATGTCCTCGATGCAAAGGACTATTTCAAAAGGCCAGCCAGGCAG 
(C,A] - • 

CCAGACATGTATTTCCTAATCGTCTCCAGGTTGTTTGATAGAAGATCTCCTGGGAGCAGG 
TTTCCGCAGCAGCTCAGCCAGGTCTGTTCTGGGAACGCTGTGTGCATTGGCACCTCCCTT 
GGCAGAAAGCTTGGAGGAAAGGCAGGTGCAGGTCCTGGAGCCTCTGACAGCATTACTGGC 
TCTAGGAGTAGCTGCTCAGGATAATCTGTCCCCATGACCATTAAGTAACTGCCACTGTGC 
GGGAAGAAGAACTGGAAATGGGGGGCCCAAAAAAATCTGAAAACCCTCACTTGAACCAGT 



104 489 GTTC AAGAGCTGGAAGGGATTTTTCTAGCCTCCAGGCAAGGTAAT ACCATAAGTCCCAAC 



AGTGATGCCCTCCCTGGGAATGATCTCAATGGGAGAATCCTATACCCTGCCTCCTCCATT 
CATTCCTTGCTCTGATGGTGGTTCTGGCTGGCTAACCTAAGTTACTCTTGCCACTAGTTA 
ACGCCTGTCCTTATTTCTCTTGTCCCCACCTAAGATGTCAATCAAAACAGCACGAGCCAT 
G C T ATGTCAC ATG AC ATGTTGT C TGTCC AGCCCAGAGCTTGT TGCT GATGGGGGCAC AGA 
(C,T) 

TAGATTTTGAGAGAAATCTCTCTGTTACCACCCTTAACATTCCAACCCCCTCTAATAGCC 
CATTTAGGATTTATCATACTGTTTCATCCAAACCTTTCATGACCTGATTTCTATTTCCAG 
CTTCAACCACCCCTTGGGTCACCACCTGTACTTATTGAGTTTCCCTAGTTTTCTGAATTA 
ATGACTGAAGATGATAAGCTTCCCTTACATATGACTCTCAAACCACCAAACTGGGATTGT 
TGTTACTCTTAGTGATAATGGTTGCTATTTATGAAACTTTTAATAGGGAACACAAACCCT 



105266 AGGCCAGAGCATCATGGCCTTTCACAAGTTGAAGAGCCACGGGCTTTCTACGGTAGCCAG 



CCACGCTTTTCCATGACTGGGGTGGGTGTGGCAAGTGATGAGGGTTTGGAGTTCATGTGG 
TGGGGTGGCAGGGACCAGGTGTCTTGGTAACTGCTGTTGCATTCACTTCAGGAGCAAAGG 
ACCAGATCTGATTCTGCAGGATCAACAATATGGACACTGCAGGCTCTGTAGACATCCAAA 
GCTCTAATGGTGACTTGGGGAAGCTCAGGAGGGCAGGGAGGTTGTACCCATTTAGAATGT 
(A, G] 

AAGATTCCTATTTTATAAAAAAGAAAAAAAGGAGACTGAAGGCCTCAGTCTCCTCCAACA 
AAGCCAGGCTGTGGGGTAGCAGAGTCTCAAAGGGTGCAGGCCCATGGCCACTGCCCAGGG 
CTCCTGCTCAGGCCTCCTCACTCCCACAACTGAGGGGAGACCCAGTTCCACACCCACCCA 
CCTAGCAGTGTCTCACACCCACCGGGAGAGGTCTAAACATCTTCCCTGGGAAATGGTCCC 
AAAATGTCCCTGCAGTAAGCAACCATCTGGAGAGGCCCAGGTCTACATCTGTTTTTAAAG 



105338 ATGACTGGGGTGGGTGTGGCAAGTGATGAGGGTTTGGAGTTC ATGTGGTGGGGTGGCAGG 



GACCAGGTGTCTTGGTAACTGCTGTTGCATTCACTTCAGGAGCAAAGGACCAGATCTGAT 
TCTGCAGGATCAACAATATGGACACTGCAGGCTCTGTAGACATCCAAAGCTCTAATGGTG 
ACTTGGGGAAGCTCAGGAGGGCAGGGAGGTTGTACCCATTTAGAATGTAAAGATTCCTAT 
TTTATAAAAAAGAAAAAAAGGAGACTGAAGGCCTCAGTCTCCTCCAACAAAGCCAGGCTG 
[T,C] 

GGGGTAGCAGAGTCTCAAAGGGTGCAGGCCCATGGCCACTGCCCAGGGCTCCTGCTCAGG 
CCTCCTCACTCCCACAACTGAGGGGAGACCCAGTTCCACACCCACCCACCTAGCAGTGTC 
TCACACCCACCGGGAGAGGTCTAAACATCTTCCCTGGGAAATGGTCCCAAAATGTCCCTG 
CAGTAAGCAACCATCTGGAGAGGCCCAGGTCTACATCTGTTTTTAAAGCTCCAATAAATA 
AAT AAAT G AAGG AAG AAAAAAAG AAG AAG AAATGC AG AAC AGGGTG AC T AAAAT TGGC AT 



105570 ATTCCTATTTTATAAAAAAGAAAAAAAGGAGACTGAAGGCCTCAGTCTCCTCCAACAAAG 



CCAGGCTGTGGGGTAGCAGAGTCTCAAAGGGTGCAGGCCCATGGCCACTGCCCAGGGCTC 
CTGCTCAGGCCTCCTCACTCCCACAACTGAGGGGAGACCCAGTTCCACACCCACCCACCT 
AGCAGTGTCTCACACCCACCGGGAGAGGTCTAAACATCTTCCCTGGGAAATGGTCCCAAA 
ATGTCCCTGCAGTAAGCAACCATCTGGAGAGGCCCAGGTCTACATCTGTTTTTAAAGCTC 
[C,A] 

AATAAATAAATAAATGAAGGAAGAAAAAAAGAAGAAGAAATGCAGAACAGGGTGACTAAA 
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ATTGGCATGTATTTTTAAATGTTTATATTAACAAACTAACACCTTTTAACATGAAAAGCA 
ATATAATTGTGCTAGCCACAAAATCATCGTAGGACTGAGAAAGGAATCGTGATTCTGAGA 
GCCCTAGAGTTAATGTGATCCAGCTGGCTCATCCCTGTGACTGCAGAAGCCTGTTTGGAG 
ATAGTGTCAGTAGCTTTTCAGGCCCTCTGTGAATTGCCAGAATGTGTGACATGAGCCAAA 



105928 AAAATTGGC ATGTATTTTT AAATGTTT AT ATT AACAAACT AACACCTTTTAAC ATGAAAA 



GCAATATAATTGTGCTAGCCACAAAATCATCGTAGGACTGAGAAAGGAATCGTGATTCTG 

AGAGCCCTAGAGTTAATGTGATCCAGCTGGCTCATCCCTGTGACTGCAGAAGCCTGTTTG 

GAGATAGTGTCAGTAGCTTTTCAGGCCCTCTGTGAATTGCCAGAATGTGTGACATGAGCC 

AAATTTCCCCCCAGCATCCCCGCCGCCGCCACCACCACCCCCGACCCAACCCTCCCGCCG 
[G,AJ 

CTCCCATAGAATAGTCACTGCCATACAGAAAAAGAGAAGTTCTACTATTTCTGGGCAAGA 
TTTCCACAAACCAGTTTGTCCCTTTCTGCTTTCATGAAATAAACCATTTGGATCAACGTC 
AGCTGATTGCAAAAATTTTCCCTTGTCTCAAAAGCAAGACTGATAAGGAAGCAAACATGG 
GAGGACCTTAGTGGCCGAGCCTTTATGTGTATGTTATTTCATTGCTCTCATAACTGCCCT 
GGGATGCTGTAAGCATGATTCATCCTGTTTGTTTATCAGTTAAATTATGTATCCAAGATT 



206-459 TAACTGCCCTGGGATGCTGTAAGCATGATTCATCCTGTTTGTTTATCAGTTAAATTATGT 



ATCCAAGATTACACAGCCTATCCAGGATTAGAACTCAGAGCCCTCGGCTGTGAAGCTTGA 
GCTCTTTCTTTTCAGTCTTCAAATATGATCATGCCATGAAGCAGCACAAAGCCCAGGAGG 
AGCCCAGTGAGGCTGGAGGGGTCCACTGGCAGCCACTCTCCTCCGTGCCCCTGTGGTGTT 

GGOGCAAACTTGGATCTTTCTGAATCTTTTAACTGTTTCCTTCTCTTCCCGTTTTTGTCT 
[G,C] 

CTGGCTGACTTGTCCTACACTCTACTCCTTGCTTATGATACTTATTTTTCCATCCACAGC 
AAAACAATTCACATCAAGGTAATTGATGATGAGGCATATGAGAAAAACAAGAATTACTTC 
ATTGAGATGATGGGCCCCCGCATGGTGGATATGAGTTTTCAGAAAGGTGTAGTACCCTGT 
CCTCCACACTAACACTAACATTCTTCTCTCCTCTTCTGTTTCTTCCTCTCCAACCCATTT 
GTCTCCTCCTCCTCTTGTCTTCCACCTCTCTGGTTCCCTTTCCCTTGTCTCCTCTCTTGC 



107710 CTTC AATGACCCCATACATCCCATGGCCTCCAATAGACAAGTCAAGAAGTCCTTTCCTGA 



ATAGATCATACTGTGGAGCAGGGAGCTGCCAGTACTGAGGGCAATGTTCCTTCCCCTTCC 
AAGCTGTCCCTCATGCCCTCCAGTACATGCCTGTTGTCACAGAGCACCCCAATCCCATCC 
CACAGCAGAGTTCCTGCAGCAGAGAAACAGGCTCACACCTTGTAGACAGCCCTGGGGTCC 
CATATCTAGGGCCAACAGAAATATTCCCAAAAAAATGCCTCTTGACAATCAATGAGCTTT 
[C,G} 

TCTTTTGTCCGCTGAGCAAGGTATAAAAAGATGTCAAAAGAAGTACCCAAAAAGGTAATA 
AAAATGTACAGTCGTGCATCACTTAGCAATAAGGATACATTCTGAGGAAGGTGTCCTTAA 
GCAATTTTGTCATCGTGGGAAAATTATAGAGTGTACTTTCACAAACCTAGATGGTGTAGC 
CTACAACACACCTGGACTATGTGGGCCTATTGCTCCTAGGCTACAAACCTGTACAGCATG 
TGCTTGTACTGAATATTGCAGGCAACTGTAGCACAATGGTATTTGTGTATCTAAACACAT 



108062 AAGGT A AT AAA AATGT AC AGTCGTGC ATCACTT AGC AAT AAGGAT AC ATTCTG AGG AAGG 



TGTCCTTAAGCAATTTTGTCATCGTGGGAAAATTATAGAGTGTACTTTCACAAACCTAGA 
TGGTGTAGCCTACAACACACCTGGACTATGTGGGCCTATTGCTCCTAGGCTACAAACCTG 
TACAGCATGTGCTTGTACTGAATATTGCAGGCAACTGTAGCACAATGGTATTTGTGTATC 
TAAACACATCTAGACATAGAAAAGGCACAGTAAAAATATCGTAGTATATAGCCTTATGGG 
(G,A] 

CCACTATTGTAGATGTGGTCTGTCATTGAGCAAAACGTTTTTATGTAGCATGTGACTGTA 
CTTGTAAAGTACACACACCACAAATGCACAGCAAGTCCTGTGCCCTACAAGCCCCTTTGG 
GTCAGTCTACTACATTATAAATGGCAAAGCCGAGCACGCCCACAGAAGGTAGCAGGAACA 
TC AG AGG ATC T G AAG AG AC AT TT AGGT AAATGCTCTTT ACCCTTT AG AGC ATT T AGTTC T 
TAGGCCTCCCCTCCCCCAATCTCCCCCCCGCCCCCCGCCAAAAAGAAAAAGAAAAAGAAA 



108214 GGCCT ATTGCTCCT AGGCTAC AAACCTGT AC AGC ATGTGCTTGT ACTG AAT ATTGC AGGC 



AACTGTAGCACAATGGTATTTGTGTATCTAAACACATCTAGACATAGAAAAGGCACAGTA 

AAAATATCGTAGTATATAGCCTTATGGGACCACTATTGTAGATGTGGTCTGTCATTGAGC 

AAAACGTTTTTATGTAGCATGTGACTGTACTTGTAAAGTACACACACCACAAATGCACAG 

CAAGTCCTGTGCCCTACAAGCCCCTTTGGGTCAGTCTACTACATTATAAATGGCAAAGCC 
[G,A] 

AGCACGCCCACAGAAGGTAGCAGGAACATCAGAGGATCTGAAGAGACATTTAGGTAAATG 
CTCTTTACCCTTTAGAGCATTTAGTTCTTAGGCCTCCCCTCCCCCAATCTCCCCCCCGCC 
CCCCGCCAAAAAGAAAAAGAAAAAGAAAGCAGAAAATTACAATTCTGGCTCACTAGTAGG 
ACCTGCTAGCCACCATTGTGATTCCATGAAGGACCAGAAGAAACCATATAGGAAGAATCA 
GGCCCACACGGCAACCTCTCCACATGACAAAGAGCCAGTCTTTGGAGGGCAGTGAATTTC 



108364 CACTATTGTAGATGTGGTCTGTCATTGAGCAAAACGTTTTTATGTAGCATGTGACTGTAC 



TTGTAAAGTACACACACCACAAATGCACAGCAAGTCCTGTGCCCTACAAGCCCCTTTGGG 
TCAGTCTACTACATTATAAATGGCAAAGCCGAGCACGCCCACAGAAGGTAGCAGGAACAT 
CAGAGGATCTGAAGAGACATTTAGGTAAATGCTCTTTACCCTTTAGAGCATTTAGTTCTT 
AGGCCTCCCCTCCCCCAATCTCCCCCCCGCCCCCCGCCAAAAAGAAAAAGAAAAAGAAAG 
(C,AJ 

AGAAAATTACAATTCTGGCTCACTAGTAGGACCTGCTAGCCACCATTGTGATTCCATGAA 
GGACCAGAAGAAACCATATAGGAAGAATCAGGCCCACACGGCAACCTCTCCACATGACAA 
AGAGCCAGTCTTTGGAGGGCAGTGAATTTCAAGGAAAGTTTTCTTCCCTGGGTGACTTGT 
TTT T AAAAG A TGTT A TG TTTTGT TG AGAT ACCC AG AG ATG AAC AG AAACTTCC ATC ACC T 
TGTGCCCCAGACCCATGATAATTCACATTGAGGAAACCAGTTTTGGAACACATCACCCCT 
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108657 AAGAAAGCAGAAAATTACAATTCTGGCTCACTAGTAGGACCTGCTAGCCACCATTGTGAT 



TCCATGAAGGACCAGAAGAAACCATATAGGAAGAATCAGGCCCACACGGCAACCTCTCCA 
CATGACAAAGAGCCAGTCTTTGGAGGGCAGTGAATTTCAAGGAAAGTTTTCTTCCCTGGG 
TGACTTGTTTTTAAAAGATGTTATGTTTTGTTGAGATACCCAGAGATGAACAGAAACTTC 
CATCACCTTGTGCCCCAGACCCATGATAATTCACATTGAGGAAACCAGTTTTGGAACACA 
IT, A] 

CACCCCTAAGTGATAGAAGCCCAAAGGTGATTTAGAATTTGATGATTTACATCATTTTCT 
T C AC ATTT T C CC AG AAATGC AT C AGCTGT AAAT AGT AAAGG ATTC C T ATGT AAT ATTGTG 
GTTAATACATATTTATTTTAGTTCCCACCACTGAAGCCCTATGAGATAAAGAATGAGAAA 
GATCACACAATTCTACCTCCCTTTCTTCTCTCTCTCTCTCTCTTTCTCTTTCTCTCTCAC 
TCTCTCTCTCTCTCTCTCTTCTCTTCCTCTGTCTGGTTTTCCTTCCTCATAAATACTTTT 



109746 GCTCC AGGTGACC AACCAATCTACCGACCCAGTCGACACACTCTCTCTCTTGTTGTCCCT 



ACAGGAAAACCATAAGGGTTAAAATAGTAGATGAGGAGGAATACGAAAGGCAAGAGAATT 
TCTTCATTGCCCTTGGTGAACCGAAATGGATGGAACGTGGAATATCAGGTGTGAGATTCT 
TTAAAAACAAAACAACAAAAAAAAAGAAAGAAAAATTAAAACAAACTGAAAAACAACAAC 
AAAAAAGAAAAAGCAGCTATATTTTTGTCTCCCTCCTTTTCTTCCCTTCTCCTCCTTTCT 
ICT) 

TTTTTGACCAATGGATTTTTTTATTCTTTTCCCTCCTGTATTCTCGCTCTCACCCTGTTT 
CGGTATCATCTCTGCCTTCTTAGCCTTAGCTTATTCCAAATTCCTCCTTTACCGCCTTCT 
GGGCAGCACTGCAGCCTCAACTCCTCATTACCCTAATGAGTTATTTCCCTGTTTTGCTAC 
AATTTTCAATTATTCAATTGCCATGGGCCCCTGCACTCTCCCCCACCCCACCCCTACACT 
GTAACCTGTAAATGTGAAAATTCCTTGGTGGGTGGGGAGGAGAAGAAAAAAAAGGAATGT 



111484 AC AGGCCTTCTCAGTGTGATTGGTCATTTCTCATTGTCTGCTGGGGACTCTCCTGCAGAG 



CTGACCACTTCTGTGCCTGCGCTGGTTTGGACACACCTGATGCTCTAGGGGCAGAACTCC 
TCTCCTTCTTCACTGCTGGTTCTCTTCGTCACCACTCAATAAAACGTTGCCCTCAGCCTG 
ACTGCCAAAAAGTGCTGGAAGAAAGAAATTATCTCTGGTTCTATTGTTTCCCACATTGTA 
TTCTTGCCCAACTTCCAGTTCTTGCCACCAACAATATTCTCAGAGGTTGCCTCAGCACCT 
[G,T] 

CCCTACCTCATTCCCACCTCCCTTGAGCATTTATTCCATGTATTCATAATTGGTTGGAAG 
CAGCAGATACCCAAGGCCAATTGTAAGTCACCTTCATCAGTTTCCACAGTCCAAGCTACT 
T AG ATG C AAAC G A AAGC AG C AC ATGT AC AGCGT AC AGG AAGG AAGGC AGTGGT TCC AG AC 
AAGAGGAAGAGATTGGAAGTCCATACATGCCTTTATTCCACCAGTAAAAAGGCTCTTCTC 
TTATGCCTCCCTTAAAACCTCTACCAACAGCAGGACAGAGAGTGACCCAAGATAAGTCTT 



112879 TTGGTCCTCCTTTTTCGACAACCGTGGGCTCATCTTGACAAGCTGCCCAGATGCTTCCTA 



ATTACTCACAGTCCTATGCTCTTTCCAGCTTGTCCCTGGGGTGTCTGAGCAGGAATAAAT 
GACTCTCACCTGACCCAGGGGATCAATACAGGGGAAAGTTCAGCTCCAGCTTCTCTCATG 
AGCAGCAGCAGGAAAAACACCCTCGAGGTATTGTGTCAGTCAAAGCTGGCCTACCCAGGT 
CTTGCTGACCCATCTATAACTGCTGAGCAGAAAGTCTTGGATTCATGGAGACAATGACCA 
[A,G] 

AGAATGATGGAATTCCAGCCAACTGCAGGCCTTCTCACTACTCTAGGGATGGGCCAGATG 
TTCGGTGGCATGTATGAGTGAAAACCAGGGCATCAGGGACCTTTCTGGAAGAGCTGCCTT 
TGTCTGACCCACCTGTGTTCATTTATGTGCTGGGATCTCTGATCTCCCCTGGAACTTGGG 
GGAAGCTCTTCCACGCAAACTCCCGGAAGGAGCAGAATAAACAAGCTCTTGCCTATCTAT 
CTATCTATCTATCTATCTATCTATCTATCTATCTATCTACCTATCTGCCTATCTATATCT 



113245 TGGCATGTATGAGTGAAAACC AGGGCATCAGGGACCTTTCTGGAAGAGCTGCCTTTGTCT 



GACCCACCTGTGTTCATTTATGTGCTGGGATCTCTGATCTCCCCTGGAACTTGGGGGAAG 
CTCTTCCACGCAAACTCCCGGAAGGAGCAGAATAAACAAGCTCTTGCCTATCTATCTATC 
TATCTATCTATCTATCTATCTATCTATCTATCTACCTATCTGCCTATCTATATCTATCTA 
TCTCAATGTAGTGAGGAAAGCCATTGATCCATTAACCTTTGGAATTCTACATGGGAGATA 
[C,T] 

CTAAAAAAGTGAACTGCCTTGTTTATGTATCATGCAGACTCTGGATCCACATATATCTCA 
GTGGCTGTGAATATAGGATGATTGATCACAGGCCTGAGTTGCATTCCTACAGATTCTTAG 
GAAAAAAATTGATTCACAGACATGTCCCCCCTGGTTCCCCCACAACACACACTCCTTCCT 
CAGCAATCTCTATCAGTCACCAACTACACGTTGAATATGTGGCAAGCTCTTCCCAGACCT 
T T ATCT GAG A GC C AAGG AGTG AGGGGCTGT ACT AAG AT ATC AT AG AAATG AAAATGTGG T 



113265 CAGGGC ATC AGGG ACCTTTCTGGAAGAGCTGCCTTTGTCTGACCCACCTGTGTTCATTTA 



TGTGCTGGGATCTCTGATCTCCCCTGGAACTTGGGGGAAGCTCTTCCACGCAAACTCCCG 
GAAGGAGCAGAATAAACAAGCTCTTGCCTATCTATCTATCTATCTATCTATCTATCTATC 
T ATCT ATCT ATCT AC CTATCTGCCT ATCT AT ATCT ATCT ATCTCAATGTAGTGAGG7^AAG 
CCATTGATCCATTAACCTTTGGAATTCTACATGGGAGATACCTAAAAAAGTGAACTGCCT 
[T,C] 

GTTTATGTATCATGCAGACTCTGGATCCACATATATCTCAGTGGCTGTGAATATAGGATG 
ATTG AT C AC AGGCC T G AGT TGC ATTCCT AC AGAT TC TT AGG AAAAAAATTG ATTC AC AG A 
CATGTCCCCCCTGGTTCCCCCACAACACACACTCCTTCCTCAGCAATCTCTATCAGTCAC 
CAACTACACGTTGAATATGTGGCAAGCTCTTCCCAGACCTTTATCTGAGAGCCAAGGAGT 
GAGGGGCTGTACTAAGATATCATAGAAATGAAAATGTGGTGTGTCACAAGTTTCCTTAAT 



1134 97 G AGG AAAGC C ATT G ATCC ATT AAC CTT TGG AATT C T AC ATGGGAG AT ACCT AAAAAAGTG 



AACTGCCTTGTTTATGTATCATGCAGACTCTGGATCCACATATATCTCAGTGGCTGTGAA 
TATAGGATGATTGATCACAGGCCTGAGTTGCATTCCTACAGATTCTTAGGAAAAAAATTG 
ATTCACAGACATGTCCCCCCTGGTTCCCCCACAACACACACTCCTTCCTCAGCAATCTCT 
ATCAGTCACCAACTACACGTTGAATATGTGGCAAGCTCTTCCCAGACCTTTATCTGAGAG 
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[C,Gj 

CAAGGAGTGAGGGGCTGTACTAAGATATCATAGAAATGAAAATGTGGTGTGTCACAAGTT 
TCCTT AATTCTT AG ATCTT AAACTCT AAG AGGGTT C AGC AT AAGT AC AAAT TC AAGGGCT 
AGAGACAACCTGTATTGGGTGTGTCTTTAACTCAGTTTCCCAATCCACATAGGGACCTTG 
C ATTT GTC ATCTCTC AT CT ATGT AT AGCTGTTGGT ATG AC AGTTTC T CTGT TCC AG AAT A 
CCTGAACTCTGACTTAGCCTGTCCTTTCTGAAACAGAAAAATCACCCAACCAGAGATCTA 

114 4 86 CCCCATGGTCATTTTTGCCACTCATAAGTTAGCTACTCTGGCAGGGTTGCAACTT ACACA 
GTTTTCATGATAACTGGATTCTCACTCCTTTTTTTACAGAATGGATGTGATAACCTGGTA 
TCCTACACAGTCATGAGTGACCAACCTACCCATTTGGTTCCCCATCCTCATTCCTCCATT 
CCTAGCCCTAGGGTAGCCGGGAAAGCATAGGAGCAAATGCCCTTACCAGGGCCCTGGTGC 
TCAGCAGCCTCTCCGGCTGCTCACACCTCTTGCTGCTGCTCTGTGCATGCTCCAAAGGCT 
[G, T) 

CTTTTTGCGTATGGCTGCTGAGCTCTCACCTACTAAGCTCTCTGCTTTCCTTATGCTGCC 
AGCAACCACAAAACCTGGTGATACTTTCAAGATGGGACATTAATGCTCTTTCCTTTTCTT 
TC TTCC ATTTTTC TGGT AT CC AT TT GC AAAC AGCGCTCCTGTT ATCTCC AGGT AAG AGGT 
GTCTTGTCCCCCTCTTTTCTTTCCACTTCTTGCCAGTGCCATTATTTGGTTTAAGACCAA 
TGTCCTTTGATTTATTGAATAAGAACTGCAGGCTCAAGTTAACCTGACAATTTCTCCCAA 

114686 GAAAGCATAGGAGCAAATGCCCTTACCAGGGCCCTGGTGCTCAGCAGCCTCTCCGGCTGC 
TCACACCTCTTGCTGCTGCTCTGTGCATGCTCCAAAGGCTGCTTTTTGCGTATGGCTGCT 
GAGCTCTCACCTACTAAGCTCTCTGCTTTCCTTATGCTGCCAGCAACCACAAAACCTGGT 
GATACTTTCAAGATGGGACATTAATGCTCTTTCCTTTTCTTTCTTCCATTTTTCTGGTAT 
CC ATT TGC AAAC AGCGCTCCTGTTATC TCC AGGT AAG AGGTGTCTTGTCCCCCTCTTTTC 
[T,C] 

TTCCACTTCTTGCCAGTGCCATTATTTGGTTTAAGACCAATGTCCTTTGATTTATTGAAT 
AAGAACTGCAGGCTCAAGTTAACCTGACAATTTCTCCCAAGGACTGGGAGATTTATTTTC 
CCACATGAAGCAATTATGAGAAAGCAATTGTGAGGAAGGCAATTCCTTGAGCATCACTTC 
TGTCTGGGGACGTGGGTTAAGGCATAGCTGATCCTCTCTGGGACCAGGAAGAGAAATTAA 
GCTTAACAAGGAGATGGTGGGTCATAGACTTCTCCTGAGTCTTAATTCATCTGCCATCTC 

114817 TACTAAGCTCTCTGCTTTCCTTATGCTGCCAGCAACC ACAAAACCTGGTGATACTTTCAA 

GATGGGACATTAATGCTCTTTCCTTTTCTTTCTTCCATTTTTCTGGTATCCATTTGCAAA 
C AGCGCTCCTGTT ATCTCCAGGTAAGAGGTGTCTTGTCCCCCTCTTTTCTTTCCACTTCT 
TGCCAGTGCCATTATTTGGTTTAAGACCAATGTCCTTTGATTTATTGAATAAGAACTGCA 
GGCTCAAGTTAACCTGACAATTTCTCCCAAGGACTGGGAGATTTATTTTCCCACATGAAG 
[C,A] 

AATTATGAGAAAGCAATTGTGAGGAAGGCAATTCCTTGAGCATCACTTCTGTCTGGGGAC 
GTGGGTTAAGGCATAGCTGATCCTCTCTGGGACCAGGAAGAGAAATTAAGCTTAACAAGG 
AGATGGTGGGTCATAGACTTCTCCTGAGTCTTAATTCATCTGCCATCTCATGTTGTGGGG 
GAAGAGACAGTGAGATTCAGAGCTGGAATCTCCTAATATAATTGTGACAGGATTTGAAAA 
AAAAATACTTTAATCCCAAGGGATCCAGGAAATAACCAAACCTGTTGTGAGAATAGGAAA 

115600 AGAGAATTTATTTTGAAGAGATTCTCATGCAGAATCTAGGTGCTATAGAGGACGTACACC 
TACTTTGAGAGTATGCTTGCATGAGTGGAAACCAATCATAAACAACATTCAACTTCATGA 
GCAGATATGAAAGCATTTTCAGCATATCTAGCAATACTATAACTCTTTGTGCAAGCAGAG 
TGGCCTACACAAGACAGTTTCAATATATTTTAAAAGAACGTCTTACATTTCATCAGTCCT 
T T G AAC AC AG AAAA AAATGTTAAGGCC AC T T AAG AGGC AAAAC AT C T T AC AG AGTTC ATT 
[G,T] 

ATATTCAAAGTCACCTACAGGCTACATCTTGGGTTCAGGAAGGGGCGGTGTACATAGTAA 
GGACATACGCCTTCTGGGAGCCTTAAACAAACAAAAAAAATGTAGGTAACTCCTACATTT 
TTCTTTTGTGGAAAAAACACAGTTACTCCAGCTTCCTTGGCTTTTTGCTTCTTTTTTATA 
CCAACAAAAT AAGGGCT ATCCTCAACCCTCTGTTCTTCATTCTTCTCCCAGGGTATTGAT 
TTCATAACATTGGGTTTTTCTTCTCTACTTCACTCATCCTCTTGCCTGTGAAGGTATGTA 

115668 G AGT AT GC TT G C ATG AGTGG A AAC C AAT C AT AAAC AAC ATTC AACTTC ATG AGC AG AT AT 
GAAAGCATTTTCAGCATATCTAGCAATACTATAACTCTTTGTGCAAGCAGAGTGGCCTAC 
ACAAGACAGTTTCAATATATTTTAAAAGAACGTCTTACATTTCATCAGTCCTTTGAACAC 
AG AAAA AAAT GTT AAGGCC ACTT AAG AGGC AAAAC ATCTT AC AG AGTTCATTG AT ATTC A 
AAGTCACCTACAGGCTACATCTTGGGTTCAGGAAGGGGCGGTGTACATAGTAAGGACATA 
[A,C] 

GCCTTCTGGGAGCCTTAAACAAACAAAAAAAATGTAGGTAACTCCTACATTTTTCTTTTG 
TGGAAAAAACACAGTTACTCCAGCTTCCTTGGCTTTTTGCTTCTTTTTTATACCAACAAA 
ATAAGGGCTATCCTCAACCCTCTGTTCTTCATTCTTCTCCCAGGGTATTGATTTCATAAC 
ATTGGGTTTTTCTTCTCTACTTCACTCATCCTCTTGCCTGTGAAGGTATGTAAGGCTTCT 
TTGTTCCAACTCTTTCCTCCACCCGCCCCCCCTCACATAAATGCATAACAAAGATTGTGA 

115745 ATCTAGCAATACT ATAACTCTTTGTGCAAGCAGAGTGGCCTACACAAGACAGTTTCAATA 
TATTTTAAAAGAACGTCTTACATTTCATCAGTCCTTTGAACACAGAAAAAAATGTTAAGG 
CC ACTT AAG AGGC AAAAC ATCTT AC AG AGTTCATTG AT ATTC AAAGTCACCT AC AGGC T A 
CATCTTGGGTTCAGGAAGGGGCGGTGTACATAGTAAGGACATACGCCTTCTGGGAGCCTT 
AAACAAACAAAAAAAATGTAGGTAACTCCTACATTTTTCTTTTGTGGAAAAAACACAGTT 
(A, G) 

CTCCAGCTTCCTTGGCTTTTTGCTTCTTTTTTATACCAACAAAATAAGGGCTATCCTCAA 
CCCTCTGTTCTTCATTCTTCTCCCAGGGTATTGATTTCATAACATTGGGTTTTTCTTCTC 
TACTTCACTCATCCTCTTGCCTGTGAAGGTATGTAAGGCTTCTTTGTTCCAACTCTTTCC 
TCCACCCGCCCCCCCTCACATAAATGCATAACAAAGATTGTGATTTAATTTAAGTTTCTT 
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120830 



121926 



122102 



122950 



TCTACTTTTAACATATTTGCAAACATCAATAGAAGCTAAAATGGGAAAAAGGAAATGTTT 

AA T AATAC TG TCGCTGCT AAG AT AGGC ATTGTG AT A TGGTGC T T A AACC TGC AAGT AAAG 
GAAAAGAGTATGGAATCTGTGTGTCTTTTTCTAAGGGCTTTTTCCCAGAGTAGCTTGCAG 
TCTGGCTTCTAGGGTTGCTGGCCTATAGCCAGAACCCTAGATTCACCCAGATTTACCTTC 
AGAATTAACTAATCAGAGACTCAAATTCAATAGACTAAATGAAGTCAGGCTGCTAGAGGA 
TGTCTGCTGACTTGGACATATGCAGAAAGACATGGATCCTTGAGAAAACATTGTTTCCAA 
[A,C] 

AGTGGCCACCAGCACTAGAGGAAGGACAGCACCACGGACAGCTCCCAGACATTTTAGGAT 
TGCCTTCTGTGTTTGGTGCCCGAACACTGAGCAAAACAGCGAACTCAGGAAGTCTCCACA 
CACTCTCATACCATCTTCATGCAGTCCAACTAAGAAAATTCTTACATAAAATATAAGGCT 
GTCTGCTTGGTAATTTAAACCCTTGGCTTATAGTCTTTTCAGTGAATTTCTTTCCTTGCA 
AAC TC G AG AGTTGG AGTCTC ACG ACTGCCCTTGC TTC AC C AATTCCC C AGCT AGAG AC AA 

CCCATTTGTAACATGAAACAAATAGTGCTGACCATTTGTATGCTAGGAATATTGTTAGGA 
AACATAATATAGAATGTGAAATAAGTGGACTAGAAAGTCCTGAGATGTATTATCATTATT 
GTTTAACTGTGTTTTTAAAGCAAAAATATTAAAACTCACTACTACAGGGCAAGATATATT 
AACATCATTATTATTATTCATTATTGTATTATTCTAAATAGCCAATTTCAAAAGTCACAA 
CCAGGCCAGGCAGTGAGGGACTCACGCCTGTAATCTCAGCACTTTGAGAGGCCGAGATGG 
(A,GJ 

AGGGTCACTTATACCTAGGAATTTGAGACCAGCCTGGGCAACATAGGGAGACTCCATCTC 
TATAAAAAATAAAACAAAATAAAAATCAGCTCAGTGTGGTTGTACATGCCTGTGGTCCCA 
GCTACTCAGGAGGCTGAGGTGGGAGGATGGCTTGAGCCCAGGAGGTTGAGGTTGCAATGA 
GCCATGATTGCACCACTGCACTCCAGCCTGGGTGACAAAGTGAGACCCTGTCTCAAACAA 
AACAAAACAAAAAGATTACAACCAAAAACAAAGGGAAATAGAAGGATTG'CCTCAAAAGAG 

CCCCAAGGCCCCAGAGCCAGAGCAAAAAGACAGCCAGGAAGAGAGGTTTGCTGCTGCTGC 
TGCTGCTGCTACCCCACTTTTCTCATCACCTGCTTTAGATCTTTCTAGCTCCCCCTCTGA 
TGACCTGACTGTGCCCCTCAAGACAATAAACGGAATGTAGGCCACATCATCTACCCTGCT 
CCTTTT AC AAAGG AGGGG AC TG AGGT TC AGAAAT AA G AG ATG AT TT AC CCC AGCTT AC AG 
ATTTTCTTCATGGCAAAGCTGGAATGAGAACCCAAGTGTTCTGACTCCTGTTCTTTCAAA 
[C,A] 

CCCAGCTTCTACCGGTTATGCCAAAACATGACAGAAGTTGCCGTTGGCAAGGCACAGGCA 
TGCCTCAGCATACCCTCCCCTCCAGGGCTGCTGAGTGGGCAACTCTGCCCACATTTCCTG 
GC AAGG AC AATC AAGGCCC ATCCTGCTTTTTCCC AT G AG ATGT T T GG AGG AGGGC ACTGG 
CTCTGCAGTATATTCTCGTGATCTGGAATGACAGCCATCCCTCAGGGGACAGATAATGAC 
CAGAACCACAATGGTTATTGCAGCAGTCAGGTCAGAAAATTTGAGAGGAGCCCTGCTGGC 

CAACTCTGCCCACATTTCCTGGCAAGGACAATCAAGGCCCATCCTGCTTTTTCCCATGAG 
ATGTTTGGAGGAGGGCACTGGCTCTGCAGTATATTCTCGTGATCTGGAATGACAGCCATC 
CCTC AGGGG AC AG AT AAT G ACC AG AACC AC AAT GGTT AT TGC AGC AGTC AGGTC AG AAAA 
TTTGAGAGGAGCCCTGCTGGCATCCAGTGAAGAGTGGCCACACCGAACTGATTTCACTTC 
TCTCCTTAGACAACAAAATGCAGCCTGTGCATTCTCCTTTCTTTTTTTTTTTAATTATAC 
(A, T J 

TTAAGTTCTGGGGTACATGTGCAGAACATAGAGTTTTGTTACATAGGTATACACGTGCCA 
TGGCGGTTTGCTGCACCCATCAACCCGTCATCTACATTAGGTATTTCTCCTAATGCTATC 
CCTCCCCTATCCCTCACCCCTGACAGGCTCCAGTGTGTGATGTTCCTCTCCCTGTGTCCA 
TGTG T TC TCATTG TTC AACTCCC ACT T ATG AG TG AG AAC ATGC AGTG TTTGG TTTTC TGT 
TCTTGTGTTAGTTTGCTGAGAATGATGGTTTGCATCCTCCTTTCTTTCTGCTCCACTGTC 

TTGGTCTCAAAGATTCAGTCACAGCTGTTGTTTTCGTGGCATTTGGCACCTCTGTCCCAG 
GTGAGAGTGAGAGGTGCTTGAATTTGCAAAGAGGATTTTACCTGGTTCAAATGACCCCTG 
GACTCCATCTCATTATCTTCCACACCATCTCAGATCTGAACTTAACAGAGCCTCTGCCCT 
TAAAGTGCACAAAAGTCAATCAAAGAGATGAATAATGACATTAGTAATGACAGCTAATAT 
TTCTTG AGC AC TTTC AATGTG AC AG AC ACC ATGTGTGTTC AGC AAT T T AC AC ATTT ACAT 
[T,C] 

TTCCCCCTGTAATGTTTCCCAAAGCCCTATTAAATAGGGTAAGTTATTATCCCCACTTCA 
CAGACAAAGAAACTGAGGCCCACAGAGGTTAAGCTACATGCCCAAGTAAGTGGTCCAATT 
TCTTAACCTCCACATTATGTGAGTAGACCACAAACAGTGAAATTAAAAGAATGTAGATAT 
TGTTCTCCTTCTATTTACCTCTGGCGATCTCTGAGAGGTTAAAGATTAGCCAGCTCAAAG 
AT ATC AAAGG AG AAATGCCC ACAT AC AT TC TTGGCCTC CTCT ACTTGGAAGGAC AC TGTG 

CCCTTAAAGTGCACAAAAGTCAATCAAAGAGATGAATAATGACATTAGTAATGACAGCTA 
ATATTTCTTGAGCACTTTCAATGTGACAGACACCATGTGTGTTCAGCAATTTACACATTT 
ACATTTTCCCCCTGTAATGTTTCCCAAAGCCCTATTAAATAGGGTAAGTTATTATCCCCA 
CTTCACAGACAAAGAAACTGAGGCCCACAGAGGTTAAGCTACATGCCCAAGTAAGTGGTC 
CAATTTCTTAACCTCCACATTATGTGAGTAGACCACAAACAGTGAAATTAAAAGAATGTA 
IG,C) 

ATATTGTTCTCCTTCTATTTACCTCTGGCGATCTCTGAGAGGTTAAAGATTAGCCAGCTC 
AAAGATATCAAAGGAGAAATGCCCACATACATTCTTGGCCTCCTCTACTTGGAAGGACAC 
TGTGAGTACAAAGTATCTCCTAGCAGGACAGCCAAAGGAAGTTCCACAGCTTTTATCTTT 
TTATAGGATGAATTACATACTCTTTCTTTTTCTTAGGAACACTCAGAGACAAACAGAAAG 
G AGCGGAC ATTC CTTT ACTC ATTG AAC AAAT ATTT AC TG AGC AC CT ATT AT GCCTGTT AC 

GGGGCCTCTGAACTCTGAACTTCTGCCAGGGAGCTGGCATCCCAGTTGCCCCAGAAAGAA 
ACAGAGCACATCCTCCTGCAGGGAAGTTAGGCTGAATCTCATCAGACAGGACTTTTCTGG 
CTGGGCCAAGGGAAATCTTTCCTGTACCAAGCAAACATATCCTTCAAGAGAGTAGCTGAA 




3OCI0: <WO 0233086A2_I_> 



WO 02/33086 



PCT/US01/32152 



TTCACATCAAATTCTAGGAAAACCTCTTTCCAAAACCCCAGCGCAGGCCAGCGGTATTAT 

TTGTCCATTAGTGATGCAAGAGATTTAGCTATCGTGGAAATGCATCAGAAGGTTGGAAAT 
(T,C) 

AGATGGATGATCCCAGGAAGGCCTGTGGATGAGATGCCCTGTGATCTCTGTTCTCCAAGC 
CTTGGGGGACCTGAACTATCAGAGGGGAGGGAGGAAATATGGGGGAAAGCATAGAGGTGG 
GAAGAAATATCAGAGGATCAGAAGCAAAAAACAACAATAACAACAGAAACAAAAACAAAC 
AAACAAACAAAAAAACAAGGCCATAGGCAAGAAAGGGTAAGAGGTTTTCTCTGGGAGATC 
TAAAAAAAATGGCAATAATGAGGTAAGCCAGGCAGATACCTTTGGGCATCTCCAAGTCCT 

GGTGGGAAGAAATATCAGAGGATCAGAAGCAAAAAACAACAATAACAACAGAAACAAAAA 

CAAACAAACAAACAAAAAAACAAGGCCATAGGCAAGAAAGGGTAAGAGGTTTTCTCTGGG 

AGATCTAAAAAAAATGGCAATAATGAGGTAAGCCAGGCAGATACCTTTGGGCATCTCCAA 

GTCCTTGCAATTGGCCAAGACAACAGCTAACAACATTTGAGGCTTTAAGAAGGTTACCCT 

GTGATCCACTCATCTGATTTAGTGGCTTTGGCTGAAGCTCTTTGGATATAGTTGAAGGTA 
(C,TJ 

GGAAAGGGTCCTTACATGAGGACTTTAGGGTCAAGTCTCTTGCTAACATCCTATGTGACC 
TTGGGTAAATTCTTTGACCCTTATTTTTCTTACCTGTAAAATAAAAGAATTGGGCTAGAT 
GTCTCTGACAGTCCTCCCTGTATCTACAATCTGTGCCAAGATCTAAAGTCAAACACCCTG 
CAAGGCCCTGTGATACATATATAAACCACAAAGACAGAGCCCCGTCTTCCTTGAGTCCAC 
AGTTCACCCTGCATGTCCCCATCATGGTTCCCCAACATGTCCTCTGTCCCCAAAATCCAG 

GGCCCTGGCAATTAACTGAAAGGGCAAAGTCTTAATCAATCAAACAATGGAGGAATCACC 

GACTTTACACAGTATTTAATTGAATACAAACAAGCAACAGCAACAAATCCACCTCCACCC 

CATCTCCCCCTCATATCCCTGACCCAAAGCAAAGGTCAGAGCCTTTCGCCTCCTTCTATT 

CCATCTTTTGATTATTCCTTTGCCTCTCATTTCTTTGGAAGCAGGGTTTCTCCTCTCTGC 

CCAATTCCATATGTCCCTATTATCTCACTCAGCTGACAAGACGTGAAAATGAGTCACATT 
(C,TJ 

ATGTGGCTGGGGTGGGGTTCTTTTTTCATTGTAATCATTATTGTGGTTGCTTTCGTTTTG 
CCGTTAGGTTTTGCTTATTATTTTGTTTTGTCTTTTTTTTCTGAAGTGAGTGAAAAAGGT 
GCCACAAAGGAATTCCAGGTCCGAGCCAACAGAGAGAAACATGAATTTTTAGACACATGC 
TCTCCTGCCACCTCTTGGCTCCATCAAGATCCAGTTCCCCATCTCACTGTTTTCTCTGAG 
TTCTTGGGAGGAGTGATGGTGTTGGGGTAGAAATAAGCTCACTCACCCACGCAGGGTACT 



125010 TTTACACAGTATTTAATTGAATACAAACAAGCAACAGCAACAAATCCACCTCCACCCCAT 



CTCCCCCTCATATCCCTGACCCAAAGCAAAGGTCAGAGCCTTTCGCCTCCTTCTATTCCA 

TCTTTTGATTATTCCTTTGCCTCTCATTTCTTTGGAAGCAGGGTTTCTCCTCTCTGCCCA 

ATTCCATATGTCCCTATTATCTCACTCAGCTGACAAGACGTGAAAATGAGTCACATTCAT 

GTGGCTGGGGTGGGGTTCTTTTTTCATTGTAATCATTATTGTGGTTGCTTTCGTTTTGCC 
(A,G) 

TTAGGTTTTGCTTATTATTTTGTTTTGTCTTTTTTTTCTGAAGTGAGTGAAAAAGGTGCC 
ACAAAGGAATTCCAGGTCCGAGCCAACAGAGAGAAACATGAATTTTTAGACACATGCTCT 
CCTGCCACCTCTTGGCTCCATCAAGATCCAGTTCCCCATCTCACTGTTTTCTCTGAGTTC 
TTGGGAGGAGTGATGGTGTTGGGGTAGAAATAAGCTCACTCACCCACGCAGGGTACTAAA 
GATCTTACAGGAGCTTCAACTGGAGCAGGAGGAGCTTTTTATGCTTATGTTGAATCAAGT 

AAAGCATTTTTACAAGATAGGAACTGGAATTCCTCATTTCTCCCATGTTCCTGCTTGTTC 

TTAAACTTCATGAAGCTATTTTTCCAGCCTATGGGGTAGTTCTTGCTCCAGTAAGAGGAA 

TCTTAGTTGTCATAATCCCTTGGAGCCTGGGTTTTTGGAGAAAGAGATCTCCGTGCCCTA 

CAGACCTTTTCTCAACGAATGTGGGAAGGACCTGGCTTTAAAACACGCACACAAACACAC 

AAATAAACAGACATAAGATGTCATCACGAAACTGCCCACGGATCTTTAGGCTTTCTGCAT 
IT,C] 

GACATAAATACATTTTCTAAGGGGGGGGGGGAAGAAATTAAAAAACACCTGTTAATTTTA 
AAC AC ATTTTTT AAG AAAAAAAT AAT T AAAAAAG AAAC AGT GCTC ATGTC AT AAGCT ATG 
TTGACAGTTGCCAGTGGAAATGTTGGGTTGGTTCAAAAAAAAAATAAAAGCTATACTATA 
TCTCTCTACATACAGCTTGCTTCTACCTGTGTTTCTTCAGTGAAAGGTCCAGGGGGCCAC 
TGTGGGCTTCTTGTGAGGAGACGTGACTCAGGTGAAGGTGTCACCTCCTCTCACACTCAG 



126064 AACTGGAATTCCTCATTTCTCCCATGTTCCTGCTTGTTCTTAAACTTCATGAAGCTATTT 



TTCCAGCCTATGGGGTAGTTCTTGCTCCAGTAAGAGGAATCTTAGTTGTCATAATCCCTT 
GGAGCCTGGGTTTTTGGAGAAAGAGATCTCCGTGCCCTACAGACCTTTTCTCAACGAATG 
TGGGAAGGACCTGGCTTTAAAACACGCACACAAACACACAAATAAACAGACATAAGATGT 
CATCACGAAACTGCCCACGGATCTTTAGGCTTTCTGCATTGACATAAATACATTTTCTAA 



GGGGGGGGGGAAGAAATTAAAAAACACCTGTTAATTTTAAACACATTTTTTAAGAAAAAA 
ATAATTAAAAAAGAAACAGTGCTCATGTCATAAGCTATGTTGACAGTTGCCAGTGGAAAT 
GTTGGGTTGGTTCAAAAAAAAAATAAAAGCTATACTATATCTCTCTACATACAGCTTGCT 
TCTACCTGTGTTTCTTCAGTGAAAGGTCCAGGGGGCCACTGTGGGCTTCTTGTGAGGAGA 
CGTGACTCAGGTGAAGGTGTCACCTCCTCTCACACTCAGGTGCCAATGTGTCAGACCCAG 

AAATAAACAGACATAAGATGTCATCACGAAACTGCCCACGGATCTTTAGGCTTTCTGCAT 
TGACATAAATACATTTTCTAAGGGGGGGGGGGAAGAAATTAAAAAACACCTGTTAATTTT 
AAACACATTTTTTAAGAAAAAAATAATTAAAAAAGAAACAGTGCTCATGTCATAAGCTAT 
GTTGACAGTTGCCAGTGGAAATGTTGGGTTGGTTCAAAAAAAAAATAAAAGCTATACTAT 
AT C TC TCT AC AT AC AGCTTGC TTC T AC CT GT GTTTCTTC AGTG AAAGGT C C AGGGGGCC A 
|C,G] 

TGTGGGCTTCTTGTGAGGAGACGTGACTCAGGTGAAGGTGTCACCTCCTCTCACACTCAG 
GTGCCAATGTGTCAGACCCAGTATATTCTAAGCAAAAATACTTCAGGAAAATGCCACTTG 



[-,G] 
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TCAAAACCTGGACTTTGCGAAGTTGGAAGATGTAAGTAGTAGTAAAAGCTGTGGTAATTA 
TGGAGGAAGGAGGTTTCTGTATCAGAAAGGCATTGGCCGTGACAGACTC 
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SEQUENCE LISTING 



<110> PE CORPORATION (NY) 



<120> ISOLATED HUMAN TRANSPORTER PROTEINS, 

NUCLEIC ACID MOLECULES ENCODING HUMAN TRANSPORTER PROTEINS 
AND USES THEREOF 

<130> CL000891PCT 

<140> TO BE ASSIGNED 
<141> 2001-17-10 

<150> 60/240, 836 
<151> 2000-17-10 

<150> 09/804, 474 
<151> 2001-13-03 

<160> 4 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 2782 
<212> DNA 
<213> Human 



<400> 1 

gtctcgtgta 

ctggttacct 

gtgccaagca 

gtcatcctgc 

attgtctatt 

ttcatggcat 

aatggagaaa 

cttatggccc 

catgggttca 

atgttcatca 

aagcatctac 

tatatgattc 

ctcttcttct 

tacaaataca 

gagggtgacc 

gatgggaacc 

cggatcctca 

1020 

atggccaatt 
1080 

gccactcgta 
1140 

aagaaggcct 
1200 

gtcttctttg 
1260 

gtggtgagga 
1320 

ggttctgcca 
1380 



tggcgtggtt 
ttgtgctctt 
cagggcagaa 
caatctggta 
ttgtggccct 
ctattgaagt 
ccagcacaac 
tgggttcctc 
ttgctggtga 
tcattggcat 
gagtcttctt 
tggcagtctt 
ttccagtgtg 
tgcacaaaaa 
accctaaggg 
tggtgcccct 
aggatctgaa 



aaggttgcag 
cctgaatggt 
caatgagtcc 
cccggagaac 
gatatacatg 
catcacctct 
cactattcgg 
tgctcctgag 
tctgggacct 
ctgtgtctac 
catcaccgct 
ctcccctggt 
tgtccttctg 
gtaccgcaca 
cattgagatg 
ggaagggaag 
gcaaaaacac 



actatgctct ttcccaccaa 
tgatgactgg tgcaggcaat 
ccagcatgag cgaggtgcac 
acccatgttc ttaccagtgc 
aagggggaga catgtcaaag 
atgcaggggc tgactatgag 



cctctcacct 
cttcgagcag 
tgttcagggt 
ccttcccttg 
ttccttgggg 
caagagaggg 
gtctggaatg 
atactcctct 
tctaccattg 
gtgatcccag 
gcttggagta 
gtggtccagg 
gcctgggtgg 
gacaaacacc 
gatgggaaaa 
gaagtggatg 
ccagagaagg 

cagaagagcc 

atcctgaaga 

accgatgagc 

ctggagaact 

accatgtatg 

ttcacagagg 



ctgccttcct 
aggctggtgg 
catcggactg 
gggacaagat 
tgtccatcat 
aggtgacaat 
aaactgtctc 
ctttaattga 
tagggagtgc 
acggagagac 
tctttgccta 
tttgggaagg 
cagataaacg 
gaggaattat 
tgatgaattc 
agtcccgcag 
acttagatca 

gcgccttcta 

aacatgcagc 

ctgaggactt 

gtggggctgt 

tggactacaa 

gcacggtggt 



ccattttggg 60 
ctcaggggac 120 
caaggagggt 180 
tgccagggtc 240 
tgctgaccgc 300 
taagaaaccc 360 
caacctgacc 420 
ggtgtgtggt 480 
agccttcaac 540 
tcgcaagatc 600 
catctggctc 660 
cctcctcact 720 
actgctcttc 780 
catagagaca 840 
ccattttcta 900 
agagatgatc 960 
gctggtggag 

ccgtatccaa 

agaacaagcc 

tatttccaag 

actcctgaca 

aacagaggat 

tctgaagcca 
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ggagagaccc agaaggagtt ctccgtgggc 
1440 

gaacacttct ttgtaaggtt gagcaatgtc 
1500 

atgcctccag caatattcaa cagtcttccc 
1560 

gtggccacag ttaccatctt ggatgatgac 
1620 

actattcatg tcagtgagag tattggtgtt 
1680 

gcccggggta cagtcatcgt cccctttagg 
1740 

gaggactttg aagacacata tggggagttg 
1800 

ataagggtta aaatagtaga tgaggaggaa 
1860 

cttggtgaac cgaaatgiat qgaacgtgga 
1920 

atggaagaag aggaq*;ccna gsggatagca 
1980 

cccaaactgg aacjt ca t -r»r *. qjn^gtcc 
2040 

atcaagaaga caa.irct I )-; -:ttg;ttgtg 
2100 

gaggccatca ccgt. )z ;%q r^cj^gqat 

2160 

ctgccctcct gctt t rjd -rr ,\ cqt:atq:ac 
2220 

gcctgtgtgc ccccca c<i a yt.ictgrcac 
2280 

atcattggca tgctcaccqc catcattggg 
2340 

ggtctcaaag attcggtc.^c aqctgttgtt 
2400 

acgtttgcca gcaaagctgc tgccctccag 
2460 

gtgacgggca gcaacgccgt caatgtcttc 
2520 

gccatctact gggctctgca gggaeaggag 
2580 

tccgtcaccc tcttcaccat ctttgcattt 
2640 

cggccgcacc tgggagggga gcttggtggc 
2700 

ctctttgtga gcctgtggct cctctacata 
2760 

atcaaggggt tctaagccac ac 
2782 

<210> 2 
<211> 921 
<212> PRT 
<213> Human 

<400> 2 

Met Ala Trp Leu Arg Leu Gin Pro 

1 5 
Gly Leu Val Thr Phe Val Leu Phe 
20 

Gly Gly Ser Gly Asp Val Pro Ser 
35 4 0 



ataattgatg acgacatttt tgaggaggat 
cgcatagagg aggagcagcc agaggagggg 
ttgcctcggg ctgtcctagc ctccccttgt 
catgcaggca tcttcacttt tgaatgtgat- 
atggaggtca aggttctgcg gacatcaggt 
acagtagaag ggacagccaa gggtggcggt 
gaattcaaga atgatgaaac tgtgaaaacc 
tacgaaaggc aagagaattt cttcattgcc 
atatcagatg tgacagacag gaagctgact 
gagatgggaa agccagtatt gggtgaacac 
tatgagttca agactacggt ggacaaactg 
gggacccatt cctggaggga ccagttcatg 
gaggatgagg atgaatccgg ggaggagagg 
ttcctgactg tcttctggaa ggtgctgttt 
ggctgggcct gcttcgccgt ctccatcctc 
gacctggcct cgcacttcgg ctgcaccatt 
ttcgtggcat ttggcacctc tgtcccagat 
gatgtatatg cagacgcctc cattggcaac 
ctgggcatcg gcctggcctg gtccgtggcc 
ttccacgtgt cggccggcac actggccttc 
gtctgcatca gcgtgctctt gtaccgaagg 
ccccgtggct gcaagctcgc cacaacatgg 
ctctttgcca cactagaggc ctattgctac 



Leu Thr Ser Ala Phe Leu His Phe 

10 . 15 

Leu Asn Gly Leu Arg Ala Glu Ala 
25 30 
Thr Gly Gin Asn Asn Glu Ser Cys 
45 
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Ser Gly Ser Ser Asp Cys Lys Glu Gly Val He Leu Pro He Trp Tvr 

50 55 60 

Pro Glu Asn Pro Ser Leu Gly Asp Lys He Ala Arg Val He Val Tvr 
65 70- 75 80 

Phe Val Ala Leu He Tyr Met Phe Leu Gly Val Ser He He Ala Asp 

85 90 95 

Arg Phe Met Ala Ser He Glu Val He Thr Ser Gin Glu Arg Glu Val 

100 105 110 

Thr He Lys Lys Pro Asn Gly Glu Thr Ser Thr Thr Thr He Arg Val 

115 120 125 

Trp Asn Glu Thr Val Ser Asn Leu Thr Leu Met Ala Leu Gly Ser Ser 

I 30 135 140 

Ala Pro Glu He Leu Leu Ser Leu He Glu Val Cys Gly His Glv Phe 
145 150 155 160 

He Ala Gly Asp Leu Gly Pro Ser Thr He Val Gly Ser Ala Ala Phe 

165 170 175 

Asn Met Phe He He He Gly He Cys Val Tyr Val He Pro Asp Gly 

180 185 190 

Glu Thr Arg Lys He Lys His Leu Arg Val Phe Phe He Thr Ala Ala 

195 200 205 

Trp Ser He Phe Ala Tyr He Trp Leu Tyr Met He Leu Ala Val Phe 

210 215 220 

Ser Pro Gly Val Val Gin Val Trp Glu Gly Leu Leu Thr Leu Phe Phe 
225 230 235 240 

Phe Pro Val Cys Val Leu Leu Ala Trp Val Ala Asp Lys Arg Leu Leu 

245 250 255 

Phe Tyr Lys Tyr Met His Lys Lys Tyr Arg Thr Asp Lys His Arg Glv 

260 265 270 

He He He Glu Thr Glu Gly Asp His Pro Lys Gly He Glu Met Asp 

275 280 285 

Gly Lys Met Met Asn Ser His Phe Leu Asp Gly Asn Leu Val Pro Leu 

290 295 300 

Glu Gly Lys Glu Val Asp Glu Ser Arg Arg Glu Met He Arg He Leu 
305 310 315 320 

Lys Asp Leu Lys Gin Lys His Pro Glu Lys Asp Leu Asp Gin Leu Val 

325 330 335 

Glu Met Ala Asn Tyr Tyr Ala Leu Ser His Gin Gin Lys Ser Arg Ala 

340 345 350 

Phe Tyr Arg He Gin Ala Thr Arg Met Met Thr Gly Ala Gly Asn He 

355 360 365 

Leu Lys Lys His Ala Ala Glu Gin Ala Lys Lys Ala Ser Ser Met Ser 
370 375 380 

Glu Val His Thr Asp Glu Pro Glu Asp Phe He Ser Lys Val Phe Phe 
385 390 395 400 

Asp Pro Cys Ser Tyr Gin Cys Leu Glu Asn Cys Gly Ala Val Leu Leu 

405 410 415 

Thr Val Val Arg Lys Gly Gly Asp Met Ser Lys Thr Met Tyr Val Asp 

420 ' 425 430 

Tyr Lys Thr Glu Asp Gly Ser Ala Asn Ala Gly Ala Asp Tyr Glu Phe 

435 440 445 

Thr Glu Gly Thr Val Val Leu Lys Pro Gly Glu Thr Gin Lys Glu Phe 

450 455 460 

Ser Val Gly He He Asp Asp Asp He Phe Glu Glu Asp Glu His Phe 
465 470 475 480 

Phe Val Arg Leu Ser Asn Val Arg He Glu Glu Glu Gin Pro Glu Glu 

485 490 495 

Gly Met Pro Pro Ala He Phe Asn Ser Leu Pro Leu Pro Arg Ala Val 

500 505 510 

Leu Ala Ser Pro Cys Val Ala Thr Val Thr He Leu Asp Asp Asp His 

515 520 525 

Ala Gly He Phe Thr Phe Glu Cys Asp Thr He His Val Ser Glu Ser 
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530 










535 






He 




vai 


weti 


olU 


Val 


Lys 


yai 


ijeu 


54 5 










550 








Thr 


Val 


He 


val 


Pro 


Phe Arg 


Thr 


Val 










c c: 
ODD 










Gly 


CalU 


ASp 


Fne 


V3lU 


Asp 


inr 


Tyr 


Gly 








coo 

580 










c o c 

585 


Glu 


Thr 


Val 


Lys 


rp 

Tnr 


lie 


Arg 


Val 


Lys 






595 










600 




Glu 


Arg 


Gin 


Glu 


Asn 


Pne 


Phe 


He 


Ala 




610 










615 






Glu 


Arg 


Gly 


lie 


Ser 


Asp 


Val 


Thr 


Asp 


625 










630 








Glu 


Glu 


Ala 


Lys 


Arg 


He 


Ala 


Glu 


Met 










645 










His 


Pro 


Lys 


Leu 


Glu 


Val 


He 


He 


Glu 








660 










665 


Thr 


Val 


Asp 


Lys 


Leu 


He 


Lys 


Lys 


Thr 






675 










680 




Thr 


His 


Ser 


Trp 


Arg 


Asp 


Gin 


Phe 


Met 




690 










695 






Ala 


Gly 


Asp 


Glu 


Asp 


Glu Asp 


Glu 


Ser 


705 










710 








Cys 


Phe 


Asp 


Tyr 


Val 


Met 


His 


Phe 


Leu 










725 










Phe 


Ala 


Cys 


Val 


Pro 


Pro 


Thr 


Glu 


Tyr 








74 0 










745 


Ala 


Val 


Ser 


lie 


Leu 


He 


He 


Gly 


Met 






755 










760 




Leu 


Ala 


Ser 


His 


Phe 


Gly 


Cys 


Thr 


lie 




770 










775 






Ala 


Val 


Val 


Phe 


Val 


Ala 


Phe 


Gly 


Thr 


785 










790 








Ser 


Lys 


Ala 


Ala 


Ala 


Leu 


Gin 


Asp 


Val 










805 










Asn 


Val 


Thr 


Gly 


Ser 


Asn 


Ala 


Val 


Asn 








820 










825 


Ala 


Trp 


Ser 


Val 


Ala 


Ala 


He 


Tyr 


Trp 






835 










840 




His 


Val 


Ser 


Ala 


Gly 


Thr 


Leu 


Ala 


Phe 




850 










855 






Phe 


Ala 


Phe 


Val 


Cys 


He 


Ser 


Val 


Leu 


865 










870 








Leu 


Gly 


Gly 


Glu 


Leu 


Gly Gly 


Pro 


Arg 










885 










Trp 


Leu 


Phe 


Val 


Ser 


Leu 


Trp 


Leu 


Leu 








900 










905 


Glu 


Ala 


Tyr 


Cys 


Tyr 


He 


Lys 


Gly 


Phe 






915 










920 





<210> 3 
<211> 126512 
<212> DNA 
<213> Human 

<220> 

<221> misc_feature 
<222> (1) . . . (126512) 
<223> n = A, T,C or G 
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540 



Arg 


Thr 


Ser 


Gly Ala 


Arg 


Gly 




555 










560 


Glu 


Gly 


Thr 


Ala 


Lys 


Gly 


Gly 


•570 










575 




Glu 


Leu 


Glu 


Phe 


Lys 
590 


Asn 


Asp 


lie 


Val 


Asp 


Glu 
605 


Glu 


Glu 


Tyr 


Leu 


Gly 


Glu 
620 


Pro 


Lys 


Trp 


Met 


Arg 


Lys 


Leu 


Thr Met *Glu 


Glu 




635 










640 


Gly 


Lys 


Pro 


Val 


Leu 


Gly 


Glu 


650 










655 




Glu 


Ser 


Tyr 


Glu 


Phe 
670 


Lys 


Thr 


Asn 


Leu 


Ala 


Leu 
685 


Val 


Val 


Gly 


Glu 


Ala 


lie 
700 


Thr 


Val 


Ser 


Ala 


Gly Glu 


Glu Arg 


Leu 


Pro 


Ser 




715 










720 


Thr 


Val 


Phe 


Trp 


Lys 


Val 


Leu 


730 










735 




Cys 


His 


Gly Trp Ala 


Cys 


Phe 










750 






Leu 


Thr 


Ala 


He 
7 65 


lie 


Gly 


Asp 


Gly Leu 


Lys 


Asp 


Ser 


Val 


Thr 






780 










Ser 


Val 


Pro Asp 


Thr 


Phe 


Ala 




795 










800 


Tyr 


Ala 


Asp 


Ala 


Ser 


lie 


Gly 


810 










815 




Val 


Phe 


Leu 


Gly 


He 
830 


Gly 


Leu 


Ala 


Leu 


Gin Gly Gin 


Glu 


Phe 








845 








Ser 


Val 


Thr 
860 


Leu 


Phe 


Thr 


He 


Leu 


Tyr 
875 


Arg 


Arg 


Arg 


Pro 


His 
880 


Gly 


Cys 


Lys 


Leu 


Ala 


Thr 


Thr 


890 










895 




Tyr 


He 


Leu 


Phe 


Ala 
910 


Thr 


Leu 
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<400> 3 

ttggatgaga tctaaagcat tattaagagt 
aagatgaatg agaataatgg catgtacaaa 
tccaagagtg aacctgaagg ctattgttgt 
tgaaggggga tgagtgggaa gcagtgaatt 
atttggtctt tatatcaaga gcaagagaag 
tatgcttaag tctgtttgtt tgttttttta 
gtggaattga gaactagcga gagttggagt 
gattcatgag gtaaggagtg atggtggcct 
aagagttgat aggatttagt gattggataa 
taacatctgc tttctaacat gagtaactgg 
aaacaggaga aaaaggagca aatatggggg 
caaacacaaa tcagcattcc tttattacta 
gaaagatcgc ccccccccca cttttttttt 
tcatcccacc caaatgaggc agtgagtggt 
ttcatcccat tcttctgagc tcaagctcac 
ctgcatatca ccccca c -iaat cttgtctcta 
catgttttat taataatgat ttttaccact 
1020 

gagtggctgg cggat^atca t rjt t cctgtt 
1080 

tttactggag ggctc<ir;<iac tact t7t gaac 
1140 

gtttggctat ggggtg?^; gca-r caeca 
1200 

ctcagaatta tagcttacrt cca^aagtct 
12 60 

gagctcagtt ttcaagqt t ? a' tgctctgt 
1320 

agtggggagt gcaaagaag.i aacactcatt 
1380 

aaaggtcctg gggtggacaa tcacttggta 
14 40 

aatatgtata cattaaacac agcaaggttt 
1500 

tcaattcaat atactctctt gcaaacctat 
1560 

cagggtaagg aagtccgaac accccagaaa 
1620 

tacatggect attttaagtt gcttcttgag 
1680 

caggacttga aggctattgc tgcatttcca 
1740 

ttttcaaegg agtctccacc agatggtttg 
1800 

tgacatctat ttttccaagt gagagactgc 
1860 

gtgaagcatc agttgtattg gtgggaacct 
1920 

ttttcctgcc tctctgatct tttctaggtc 
1980 

caatagaagc cagtggctaa gtctcgtgta 
2040 

ctgccttcct ccattttggg ctggttacct 
2100 

aggctggtgg ctcaggggac gtgecaagea 
2160 

categgactg caaggagggt gtcatcctgc 
2220 

gggacaagat tgccagggtc attgtctatt 
2280 



ggggagtgca aagaagaaac cctcatttca 60 
ggtcctgggg tggacagtca cttggtataa 120 
tgaaatgtaa taagggagag agtgacggga 180 
cctgcaaggc tttgaaggtc atgggaaaga 240 
actactaaag ggcttcaaac aggggagega 300 
aaaaaagatt acggtggcta tatgaggaaa 360 
ggtgagctcc attaggaggc tactgaagta 4 20 
gggctgggat gatggtggta gaaatggaga 4 80 
gggacagaag agagatgaag gctttcagac 540 
gtggctgaag atgctatttt ctgagctggg 600 
atgaagactt tgagtcttta aggtgctgta 660 
agggtatccc acacagttgt agcagaggga 720 
ttttttagct attccatggt attttcattc 780 
aagatgagta tataatagtt teaattgeat 840 
cttttagtgg tttgaggeca gtagatgaag 900 
gtttaacaaa acttatttga gagacatttg 960 
tgttcctttc catgtttggg tttgaaattt 

actgcctgct taaactgetc ataagcaggt 

ttggtcttgg gcacaactta catggcctct 

tttatcaact cttttgattt cccaagctat 

tgcattcggg gaggaagttt ctttccaagg 

ttaatggatg agatctaaag cattattaag 

tcaaaatcga ttgagaataa tggcatgtac 

taatcctgga gtgaacatga aggecaagga 

tcaattttct ggggactagt ccatgaaaat 

gttatccaag atactcaagt ataatgacaa 

cagtataaat gggcatgaag attcaggtta 

aactctcaca ggtaatacca gtttgggaga 

tccccagtat tcccagctat ttcaagecat 

gaggacagag cagctatttg tgcctcccat 

cccatatgtt agtgcaatat gtcactggag 

geegtttget gtcccctttt tcctcatgcc 

tetggectat caggaggaca actggtgctg 

tggcgtggtt aaggttgcag cctctcacct 

ttgtgctctt cctgaatggt cttcgagcag 

cagggcagaa caatgagtcc tgttcagggt 

caatctggta cccggagaac ccttcccttg 

ttgtggccct gatatacatg ttccttgggg 
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tgtccatcat tgctgaccgc ttcatggcat ctattgaagt catcacctct caagagaggg 
2340 

aggtgacaat taagaaaccc aatggagaaa ccagcacaac aactattcgg gtctggaatg 
2400 

aaactgtctc caacctgacc cttatggccc tgggttcctc tgctcctgag atactcctct 
2460 

ctttaattga ggtgtgtggt catgggttca ttgctggtga tctgggacct tctaccattg 
2520 

tagggagtgc agccttcaac atgttcatca tcattggcat ctgtgtctac gtgatcccag 
2580 

acggagagac tcgcaagatc aagcatctac gagtcttctt catcaccgct gcttggagta 
2640 

tctttgccta catctggctc tatatgattc tggcagtctt ctcccctggt gtggtccagg 
2700 

tttgggaagg cctcctcact ctcttcttct ttccagtgtg tgtccttctg gcctgggtgg 
27 60 

cagataaacg actgctcttc tacaaataca tgcacaaaaa gtaccgcaca gacaaacacc 
2820 

gaggaattat catagagaca gagggtgacc accctaaggg cattgagatg gatgggaaaa 
2880 

tgatgaattc ccattttcta gatgggaacc tggtgcccct ggaagggaag gaagtggatg 
2940 

agtcccgcag agagatgatc cggattctca aggatctgaa gcaaaaacac ccagagaagg 
3000 

acttagatca gctggtggag atggccaatt actatgctct ttcccaccaa cagaagagcc 
3060 

gcgccttcta ccgtatccaa gccactcgta tgatgactgg tgcaggcaat atcctgaaga 
3120 

aacatgcagc agaacaagcc aagaaggcct ccagcatgag cgaggtgcac accgatgagc 
3180 

ctgaggactt tatttccaag gtcttctttg acccatgttc ttaccagtgc ctggagaact 
3240 

gtggggctgt actcctgaca gtggtgagga aagggggaga catgtcaaag accatgtatg 
3300 

tggactacaa aacagaggat ggttctgcca atgcaggggc tgactatgag ttcacagagg 
3360 

gcacggtggt tctgaagcca ggagagaccc agaaggagtt ctccgtgggc ataattgatg 
3420 

acgacatttt tgaggaggat gaacacttct ttgtaaggtt gagcaatgtc cgcatagagg 
34 80 

aggagcagcc agaggagggg atgcctccag caatattcaa cagtcttccc ttgcctcggg 
354 0 

ctgtcctagc ctccccttgt gtggccacag ttaccatctt ggatgatgac catgcaggca 
3600 

tcttcacttt tgaatgtgat actattcatg tcagtgagag tattggtgtt atggaggtca 
3660 

aggttctgcg gacatcaggt gcccggggta cagtcatcgt cccctttagg acagtagaag 
3720 

ggacagccaa gggtggcggt gaggactttg aagacacata tggggagttg gaattcaaga 
3780 

atgatgaaac tgtgtaagta accttcctgt attctgcccc tccctgaccc catcttttgc 
3840 

catctctttc tgtctttctg tactgcactt tacaacattt ccttgtgttt gtgttaatgt 
3900 

caaactttgg ttccatcaca ggtatgcagg atcagcagac accactggac aggttctgct 
3960 

tccaaactct tcttcagttt tctcacttta aattgtttct gggcaaggaa tcctgtgaca 
4020 

agagctaagg acacaaaaca ttttcttctc tgaaacacaa aatgatagct ggtggagctg 
4080 
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tgggatgaca gaagttttgt gatatcagat 
4140 

agagaactgc ttgggcctct tcttcctccc 
4200 

gaaaccaatc ctttccattc tgggcagcaa 
4260 

gtggcattgg ctttgttccc tggtactata 
4320 

ggcacgtctt ctttaatttg gaaattgcaa 
4380 

tgaccataga ctatttaatg gtgtgaggct 
4 4 4 0 

gtttggatac tggaaagtat cttggagaag 
4 500 

ttagtcacag gtctaagctc caaaatctgg 
4560 

ttgtgggagg aaatatatct tctgtgtggt 
4 620 

aagtacctct tcttgggcta ctgccceatt 
4 680 

tctgtcacct ctattaccct aaggcctctg 
4740 

ctgtcctttt tccagttgga tagctttaaa 
4800 

tgtagcttat gagaaagtag actttcttgg 
4 860 

gaacttcctc tgaatgactg aacatatcca 
4920 

aagagtaaaa ggagcatata ggaagttgtg 
4980 

aattatacct tcagtaagaa atgaagccat 
5040 

tcatggaaaa gtggggagcc atagaagcta 
5100 

aggtccagcc atgagacaga gctcaaaaag 
5160 

gacccggggg atcctgcatc tgttcttact 
5220 

aggtcattta ttggttctgg aacttaactt 
5280 

ttccttgcct ggaggcaagg tcagtccttt 
5340 

gaaccataga cataagcact cagtgtaggt 
54 00 

gaatattgtc tcctgaaggg aaatggccat 
5460 

gtcattttgc tgggaaggat aatcatttgt 
5520 

gaatactatc catttatatg tactctgggg 
5580 

ggcttcactg ggaaaacaaa ctccatggaa 
5640 

aagaagatat ggtaaggcat taaatgacat 
5700 

gacttgtggt tcttctgatt cagaaccctt 
57 60 

agagatcaga ggcttagttc attctattaa 
5820 

acctagggtc tctgaatacg actaaacctt 
5880 



tttggagaat 


tcttgtgact 


aagaaggact 


ttcctcatat 


gaagggtatc 


tatgagcttt 


tagcccatca 


gaacattcta 


aagaaaacaa 


ttgccagtct 


cactgtgtaa 


ccagattcca 


aattgataga 


aatttagcaa 


tctttttaaa 


tgcccagcct 


agttgaattg 


agtcagtatg 


cagagctccc 


agggcagtgg 


ctacttgtct 


tgaagcagtg 


aaggagaaac 


atcctaggaa 


cctctctttt 


cacagtctag 


gactctcctg 


cagcccttca 


gaaactgtgg 


gtattacact 


cccattgaac 


cctcttgcaa 


attggttatt 


agggaaagca 


gaatgacttt 


cctcaggatt 


gtggcctaga 


aggttggaga 


agacaaacgg 


caaataataa 


gcgtggcagg 


agatggtgtg 


tgtgtggggt 


gtctgtttca 


agaacctgct 


acaacctcta 


gaagaggagg 


aggaaggaac 


gggagaggtg 


tcctaggagt 


gcttctgccc 


agctgggcac 


tgctggtgac 


agaactgagt 


caatcccttc 


ttaataatgt 


gacttggggc 


tctgatatgc 


aaactgggaa 


taacaatact 


ttgcagttcc 


ttccagctct 


aagattttct 


catattcgca 


cttgccaaaa 


atggatcagg 


tgacaaattg 


atttattaga 


gctctgttta 


taacgtaagt 


agaaacctgt 


gccttctgga 


agagtgttta 


tacatacaaa 


tgaaggacag 


tttcacatga 


ttatcgcgat 


gtcagtgtgg 


taagaccaca 


aaatttgcca 


taatttgacg 


tctacccatg 


tcacggatag 


gtagtttttc 


tttcctcatt 


ctattaataa 


tcaattatgc 


cctcaaactt 


atttgcattt 


tcagtttgta 
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taatatcttg gtgcaaatga gcctcgcaaa tgatcacttc tgggtaatac tcattctaaa 
5940 

ggtatgtcaa ccttgagaat tctggtctag atattctagg gtttggtgaa caaatctatg 
6000 

ttcccatcca tcccttttca tttatttttt agacttcatt cattgcagaa taatgagtcc 
6060 

aaaacctgct catctgttct cacgtggcac ccctattctt gatattttaa attgcaattt. 
6120 

tacaactaga ggcagtatta cggagcagaa aaatcgtggg ttctaagtac tctgggttag 
6180 

gattctggct ccactactga tttaataatg tagtttgggg aaattttatt aacctatgaa 
6240 

attatttcct cattggcaaa atggggataa taatatctct cttgcagggc cattatgacg 
6300 

attcaaggta ttgtatgcgg tgtacctggt acacggtata tgctcaggaa acaagactct 
6360 

tcatagtaat attgacgaat taacaatatt cttcagaaga cactgtggag ttgtttaggt 
6420 

tacttggctc tttgtgtgac cctaagtaat gagcatgcca gtttggggtt actatgaaga 
6480 

gtacttacct aaactcataa aatattagag ctagaaagga ccttagaata tcttctgcag 
6540 

tcatggttct taaattttaa tgtgttgctc aatcatccag ggatctcact gaagggcaga 
6600 

ttaggatcca ggaggtctag gggagggatt gagattccgc atttctaaca agttctggat 
6660 

gctgcgggcc ccaacttaga ggtgaaaggt tctgaagctc ttgaccaaac caggagaccc 
6720 

agcaaagaag tggtttttca gacaacttgc ttaattgaat aatgattgtt tgctctttaa 
6780 

ttccaacttt caatgccaat ttagcaagaa ccagaggctg tgctaattgc cacaccagtc 
6840 

tggaaaccga aatggatagc ttcagggtac ttggacaaag ttggaacatc tgctttctaa. 
6900 

tctctccctc tttgtatagc tttatttgcc taccaagcct ggtagtattg aaaatctgcc 
6960 

ctcactatac tcccctaaat ataatcaagt tgaggccagg cctgtgctct atcaataata 
7020 

taggatccac gaattcacat gtttggtttt atgctttact tcttcaaagg tgcttttagc 
7080 

agcatggaag aatggaaaag cacgagcttt ggaatatgaa agcagatgtg aatccatcac 
7140 

ttaccagtaa cttttaacaa gtcacatcac ttttctgagt accaggtttt tgttggacaa 
7200 

cagaaataat attctctatc cttcaaggga atactaaata taagtatgag aaaaatgcac 
7260 

agtgccttct cgtagatggt gttcagtcat tcaacaaaca tttgttagat atttgctatg 
7 320 

tactagctac attactaggc actggggtta aataagtgaa taagacaagc tgacatttca 
7380 * 

gcgctcaagg atcttactgt caagtggaga ggatcaaagg gtacagacaa atcaaggaac 
7440 

gtgagagaag tggtatggct gagatggatt gaataaagga gcaatgagag ctccctgcaa 
7500 

tgtgtgtggt accactgagg attctaaatt aaccttcatt aaggacttag tagtgacaga 
7560 

ggtgaagtgg ggataggtac atgattaatt tacatccata ttacaatgaa accttaacat 
7620 

ttaagaggga tattattgat gtcttcatga tccagaagaa tcctcacctt tgcaaccatc 
7 680 
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actatagtca cttcttgaga attatggcct ttaagactgt agcatgcaat gacaaaacct 
cacagaggta tgggttctgc ccgcacacta atttcactca ttaaacaagt gactggctcc 
7860 tCCCa9 gCtGtcagca c 9 c ctttgca aaataacaga ttattgcagc tcttggacct 
ll% t9CCtC tgggaatagt caaagccaca gatgtcaaat atgtaaatgc caagatctat 

tataattaaa tagtgcaggc ctccttcaaa gaaaaaaagc atgttggctg tgctgcacgt 
'96 0 

tctccaacca aatcagaatg ttaaagctcg aaggtatctg acctcccatt ttttaaatta 
o 0 s 0 

tqaagatgaa attcagaaag ggaaggtaac ttatccaaga ttacatggct agctatgata 

gaaagttaga gttggaaagg acgttagaaa gtgagggttt gaaaggactt tagaagctgc 

ttattcaatg ttctctctgc cctttcccat cttaggcttc tccattttac ttttatccat 
o 2 U 

caataaaatg ttaacttcaa aaagaatatg gcaattcttg ggtaaaagat gctctggaag 
tgtgagtccg ggagtattat gtgactaatg tcttaactaa gaataataat atattatgga 
ctagttttaa tctcttgttt caccttgaac tgttcaggaa ggaaaatagc ccacggaaat 
tttttaaaaa gtctttctct atctgaattg agaaaaggtg acaggcatag ttggaacatc 
ttttaggcag tgctggtgaa cttcaggcta ggccttgttc catgaaataa taaaaatttt 
caaaataatg cagaccattc ccttccaggg atgctttctc tgtaatgttt taaccccaag 
3640° tttCt gtaaaaatct ataaaaatct ggagtgttcc aggatacaat ttgcacattc 
8700 atttaa Ctaaaacaca atc gattttt tgttttcttt ttctttggct tagcaaggtt 
ttaagatagt ctctttctgg ccacagaggg agatgatttg cctctagaat accctttctg 
SS^"^ 939 agtCacaaga ctgcaagctc atggaggatg agagtcaagt agaggtggtg 
Qoo^ CtCtCC cttg 9 ccaac atccctctct ttctctttcc ttctgccttc agtggcagta 
gcaaaagtcc tccttctctt taggtagaca gtcagccact acaactgtgg cttcctgaaa 
9000 Cagtg9 agCtatgtac ttggcacaga tttgtcttga agaagggact ccatttctga 
gccagttgtt gaatggggat acttagcagt acagtgaggc atttccagta ggattgttca 
accacaattg cccactttcc aggcccaaag gaataattga aggctatgta gacttttttt 
9180 tttttfc ttttt:ttttt t1:t g a g a tgg agtctcgctc tgtcgcccag gctggagtgc 

agtggcacat ctcggctcac tgcaagctct gcctcccggg ttcacgccat tctcctgcct 

92 4 0 

cagcctcccg agtagctagg cctaatatat atatattata catatatatt tatatttata 

tatatatata ccaccacgtc cggctaatat atatttatac tttttttttt tagtaggaaa 

93 60 

ggggtttcac catgttagcc agtatggtct cgatctcctg acctcgtgat ccaccagcct 

cagcctccca aagtgctggg attacaggcg tgagccaccg tgcccgacca tgctatgtaa 
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actttttagc agaagcttta gctattgtgt cccgaagggc cccaggtcat gatgaaatgt 
9540 

cttttttttt ttttgtctct tttcttctta attaotgaga ctgtcaaaga atatgtcaaa 
9600 

gcatgacata ttccaactcc aggatccata aaacacccca agttctgtgg agaccctatc 
9660 

acatctgcaa aactctccag gaagtccaga gccctcctgg ttaatttgtt ttagggacta. 
9720 

ggcatgcggt atcccctgac aacactggat cagcaattct cctacctaag tcagtcccac 
9780 

accatgtgca gcagagtatc cagtgcccct gccctggtct gctcacattg gtttgctctc 
9840 

cagaataata attcctcaat atccacaaga gattgattcc agaactactc cgaggatacc 
9900 

aaaaatcctc agatgctcaa gtacctggta taaaatggca cagtatttgg catatgacct 
9960 

aggcatattc tctcccatat actttattta tttatttatt tcgggacaga atctcattct 
10020 

gtcgcccagg ctgtcactcg cttattgcaa cctctgcctc ccaggttcaa gcaattctcc 
10080 

tgcctcagcc tcctaagtag ctgggactac agacgcatgt caccacgcct ggctactttt 
10140 

tgtattttta gtagagacag agtttcacca tgttggccag gctggtctca aacacctgac 
10200 

ctcaagtgat ccgcccacct tggcctccca aaaagctggg attacaggcg tgagctacca 
10260 

cgtccagccc cccatatact ttaaatcatc tctagattac ttataatacc taatacaatg 
10320 

taaatgttat atagttgttt taatgtattg ctttttttat ttgtattgtt ttttattgct 
10380 

gtattatcct tttttatgtt ttattttttc aaatattttc tacccgtggc acccacagtt 
10440 

ggttggtgga acctgcggtt ggtggagccc atggatgtga agggctgata gtatgagaaa 
10500 

actcagaggt gcagagttgg agagcacatc ggggagaatg tcagcatggg ttaaaaaaga 
10560 

cacactgtgg ttggagatga tcacatgaat ggccacttca aaaatgaatg ggtctcatcc 
10620 

tcaaagcagg ctctcctggg cactgcttgg gaaggtgcta attggagctt caggcaacaa 
10680 

taataagggg atacaggtgg ggatcctgcc atgggcgtag cttactttct ctggactctt 
10740 

ctgggtctta aggccagttt cctcatccac tcaaaagaat gacagcaagg tgagcaaagc 
10800 

aaggcaggta aatgaggagg actctttctg gctgtccaac ttttcatcaa cttcccaaag 
10860 

gtttttggat gggacatgag cactcattcc ttctccaccc tttagctagg ccctgtcaac 
10920 

tccaggagga aggtagaaga ggtcagagct gtggtctttc acttattcaa gatgtttcct 
10980 

tagtgttttg tgtttgggtt ttttttgttt tttttttttt gacagagtct tgctctgttg 
11040 

cccaggctgg agtgaagtga agtggcataa tctgagctca ctgcaacctc tgctttcgag 
11100 

ttcaagcgat tctcatgcct cagcctcctg catagctggg actacaggca tatgctacca 
11160 

tgcctggcta atttttgtat ttttagtaga gacggggttt tgccatgttg gccaggctga 
11220 

tctcaaactc ctgacttcag gtgatccagc caccttggcc tcccaaagtg ctgggattac 
11280 
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11340 tgaa ° Ca ° tgcacct gg ccc cttat tgttggtttt taaaagagaa actaagctgt 
11400° a9aa CCCagtttga Saaagtttga agacctggca tagagccagt gacatataat 
11460 9tt9a agaaagaga 9 ctccttgatc tgcaaataga gcacggcccc atatttaaat 
tctgcacatt ctagaagcat tttgcaagaa tcaaatgctt tgaggatttt gctaaataac 
catggaggaa agcactagac aaatattttc agatggcatg agagttatca ttcataggaa 
11640 tttCC a ° tCCtacca cttactgggg acccaagtaa gaaattactt ggataagcag 
aggagaattt aaagttgaat gtggtggaac ttattatgga aaaaatatgt ttttctgaaa 
actggatatg tgtatatata taagttcagt tgtcattttg gaaccatcct tactcttcct 
I?820 a " at tagC3taCat aggtgcaact tgactaactc tgcctggacc caattcagtt 
11880 tt99t g9gtagggtt catgaagaag cagttatttg tggagtgtat agaaaccact 
tttll 9tagg ttcttta 9" ggtactttca aaataagtga catccaaata gtaacttaat 
12000 aaata tggctgcaaa acaaattgtc gattatggat gactactact gccatctctc 
12060 Ca9tC CatCttCtgc caggctgttt ggtcttgatt tgtcgacctt ttaggtttct 
12120 t9tat tCCaCatgac cttcaccaac cccacttcta tctccaaacg tctttctgag 
12180 9 " at 9Cagatgtat tct gccacca tcacaagggc taaccgagcc ctggctgcgg 
12240° att9 ttgttcacat tatttccatt cttacaccct acttcatgtt tgtacactat . 
12300 taCat ttgCtgtCtc ttctaaacat tctttgctgc atccactttt tctctatttg 
12360 ta99t gCtgcagagg ct aatgctgg gtttcctttc attcctcctt gcactcagca 
12420° ttCt CaattCcttt tgccatgtct ccactttaaa tcttaaccta ctccagatag 
12480 tCCtt CaCa ° tattg gcatctgtgc ttgggttgct ttcagtctat tctctgatct 
12540 ttCtt tgCatgatca agaaggtgcc atgaaaggat cccttaagaa agcctgtcat 
"eoo" 933 ° gaaCtagct tcatgatagc accaggaaga ctgatatctc ccaggaaaca 
12660 CtCat 9gtggtgCtC tttttgcctt cactatgaag tgtttgtctg cctgtatgtg 
aaaacgagag ggtttaattg taaggatgca gcacagattg ggactggcat cagaaagcca 
12780 9aCt9 aggtagCtct agagaccgct ttctgtctcc agtgctctcc ctcctgggtg 
1284? ttttC tgtCtCCtgg "tctctgct tctctctatg ggcttcttta ttatttgcag 
12900 aat " taccccaaag tc ctagctca tggctcctct ctgcatatat gctttctgtt 
?2960 CCaCa aagCtCtttc tattcttcta gtttaaattt tcaagagaag aaatctgatt 
13020 tt3aC ° tggtCatgt "aagaccac tgaccacata tgagctggtt gccctgtgtc 
13080 CCCCC ttCtCCCaCC ctc ttcccct ccccatctgg tctgtcataa ctgaatgatg 
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gagtgggaaa ttgaaattgc catgggaatt ccatgataag ctatctaaac agttttatct 
13140 

ataagtggta gacagagtca cttagaaggg agtcccaggt gagacaggca cctgtcaact 
13200 

ccaaactggc acacattcta aggtctgcaa caccccagag agagcactga ttttgtagtg 
13260 

gcctgtactg gggcggtagg ctggagaatg ggagaaatag ccacttcaga atcccccagc. 
13320 

ccaaatgcat caagctcact atagactctg cagccacgat tcagctggct tctgctcaga 
13380 

tcaacagaaa acattcttag tgaatgatgc ttgtggcaca tatctcaagg ctaccagggt 
13440 

catttcttcc catttacttt ttctctgatc tatcctctcc aggacactag cgtcagaaga 
13500 

taatcttccg tcgttttcag gtacactatt tgggtactga gtcactttca aagcctcttt 
13560 

ctgggtttgg atttccagag cagcctgtgc tgtaaagcaa gacagaaagc ttccctgcca 
13620 

ttcatgcctg ccagggatag aatgacagta ctcctgaggc tctccctccc cacccctccc 

13680 - ^ 

ctgctggaca gctgatctgc tggactcagc cagagccagc aggcaccccc tctttatcct 

13740 

aggagctgca aacttgatgc ctttccagga aatccccaga agct'ggagta tcctcatcta 
13800 

catgtggcac agtgtatggt tgtgtcaggt gctcatgtcc cattgcatag gactggggtg 
13860 

gaaaataggg accgtccttt tgtgtcagct ccagtcaatg agtagtggcc atccaggggg 
13920 

ccatcttgga aaggacttgt gaggctgtat ctgcgctcag ttgtagatgt gagaagaaaa 
13980 

ggccaaatat ctgccaatcc tagtcctggg attcaagata gaaagaactg catggagtga 
14040 

agaaactagg agtctccatt tcactgagat gcataagaat gaaattattg tcactatttc 
14100 

ttcaatactg ggccaatcct aataagaaaa ccctttttga gtctctcttt tctttatcct 
14160 

acatataaca cagaagcttt ttctattccc tggatgaacc cacagggaca gaaattcttg 
14220 

ttggacaggt gaagcagata atttctttat cagactagaa tcttccagaa gcactgctaa 
14280 

cctagtgagt tttgtactct agacaggtgg ttctcaagcc agctccccac cgcaggcctt 
14340 

tttcatggtc tgcccctccc tgtggaaccc atgttttagg ttattagctg ataattggat 
14400 

ttctattttt tctcataaaa tacagcaaaa gatagctagt gatattatga tgagttaatg 
14460 

taattatagc caaagcagag agaaacaaca ttttaattaa cctgtgtgga ctgctggaag 
14520 

aatataaact ttctattttg ggggttgagt agagacagaa atgaacacag ccaagggctg 
14580 

actgtcagag gacatttaac tgatgtaaaa tgctttgaaa ttattgggca ctcattgttt 
14640 

aaagttgttt ttgatgatgg taactccgta aggggatcag aacatgctgg aaagaatggg 
14700 

cacagctttg gttacctggg ccttaccact gttattcagg cctctgagaa agcttactat 
147 60 

tgttgttatg tttcttacat aataaaactt ctaatatttg tatgaaaaca tagaattcca 
14820 

cttttaaaga tgtaaggatt ttgtcatacc attagggtta ctatgatcac ttgattctag 
14880 
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gtctaagaaa tattaagtaa tttacccgcg 
14 940 

ccttgatctt ctaataccac atattctcac 
15000 

aaaaaggtag agagtagaat tgtagttatt 
15060 

ggttaatgga tacaatgtga agttatgtaa 
15120 

gtagggtgaa tatggttaac agtaatttag 
15180 

tgaatattca caaagaaatg ataaatattc 
15240 

ccagcatttt gggaggccga ggtggatgga 
15300 

tggacaacat ggtgaaaccc cgtctctact 
15360 

gcgcacacct gtagtcctag ctacttaaga 
15420 

gaggcagagg ttgcagtgag ccgagatcac 
15480 

gacactctgt ctcaaaaaaa aaaaaaaaaa 
15540 

ctaattaccc tgatttgatc attacacttt 
15600 

ccatacatat gtataattat tatgtgtcaa 
15660 

tgtatttaca aaacatatgt aaccattaag 
15720 

gataaaatta cagtataaga ttgtgcttga 
15780 

atgagaaaaa gtgcaaagaa ctaagtccta 
15840 

cttctctggc cttttgcttt gaaagatttt 
15900 

cagccattct cacttgtgcg ttggctttgg 
15960 

agtggaaagg ctgcatgttg tattaatgca 
16020 

tgggaagtga gactgacggt ccgaatggtg 
16080 

ctttgtgcac ttttaaatgt ttaacatgag 
16140 

aaaattttag ccetagatct tctgatttta 
16200 

ttttagcatg agaataaggt tatgaagctg 
16260 

aggaagttaa gaatgcattc tacgtttctt 
16320 

ataggggtaa ctgggcccag ggagtttggc 
16380 

tcagaaatct ggattcacca tccttgactg 
16440 

tctgcagagg attcttgttt gtgtcacttc 
16500 

tggcctgctg cctgggtaac agggcgaggc 
16560 

acctcctttg atgccatgtc catggtgtcc 
16620 ' 
gcagtgatga gatgagagca ggaggggctc 
16680 



aacacagagt tttaagggta agtatcaaaa 
tcatatgtgg gagctaaaaa tattgagctc 
agaggatgcg aaggaggata gggagaggtt 
gaggagtaag ttctagtgtt ttgtagcact 
tgtatattta aaaaaaaata gacaggattc 
agctgggcgt ggttgctcac gcctatattc 
tcacctaagg tcaggagttt gagatcagcc 
ataaatacaa aaaattagct gggcatggtg 
ggctgaggca ggagaatcgc ttgaacctgg 
gccactgcac tccagcctgg gtgacagagt 
gaatgataaa tatttaaggt gatagatatg 
gtatacatgt gtcaaaatat cactctgtat 
ctaaaaataa aaggaaaaaa atcatttcag 
aataatgttt taaattatat ctaagggtgt 
aaaagtgcaa taagaagtaa atatgtacag 
agcagactat acctttccta ctgcatggta 
gcacccagca tggcaagtgg ttagcagagg 
gagccatata tgttgttcag ctgggtgtgg 
ttgttaagaa cctctaagag tgatttcttt 
gaaagacaac ttttaatctt ttactttaca 
catgcatttc tttaataata aaaatacaaa 
aactgcatat tctttctatt gtgttacata 
gaagtagcag gctccctttt cctcatatgt 
ctttaaggag ttggcttctt tccttttaac 
aagggccaaa taaagtcctt aatgcccagc 
ctggctccaa cccaccctca cctgagctgg 
atcaccagca actaccgaca gatgatgctt 
tggctcagga ccatgttttc agatcagggg 
gagggcagcc aggatcaagg gctagacggg 
agctgcagcc ccaggagagc ctatgccagc 
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cctgttgacc aaggaggaca gaagcaacag gagagcggag gcagaggggt gagtgtctat 
16740 

cgctcaatgt ataatcggca gacatttggg gagctcatac tgtgggctaa gcacagggaa 
16800 

gaaaggcaca gtccctgtcc tcagggaggt cacagttgat agggaagaca agcatatgtg 
16860 

ctagctgcta tagaaggggg aaccactgag ggctgtggcc acacagaggc aacaccccct- 
16920 

tcttgttttt ttgtcaggga ttcagtttgg cgtcattaga agtgacttgc acaaccccct 
16980 

cctccagtca attcagaagg acttgttaag caggaatgat gaattagctt cagcttgtgg 
17040 

ggcacacaca gatggaagta taaggtggcc tcaggagtaa gtaaatcccc atgcaagctg 
17100 

tgtccttaga ccagagcagc acccggttct tccccatttc tagtaaaggt gcctcacaca 
17160 

ccaccaggac acaaittatg cctgcagaat gaatgaatga atgaatgagt gaattcctgg 
17220 

aacctcttct get tat ji ;z cncaccaggt tgeagcaage ccagggacac ctgggactgg 
17280 

aattgggctc tca^t:;tdd ggjccaggga gcacccacca ttttgeatte ttcagccctt 
17340 

cctcctctcc tgtcrc* v rcag:<iatat ccacagagcc ctctgagcaa ctctgagcct 
17400 

ctccacagcc tgao.;^_: ir c'.-iiKdCca gctcttcaga gggtgtttct gtgetgetea 
17460 

gctacctctg ageetc nci gectttqatg ctcaggagac accctgtaat tcaattaagc 
17520 

cttctctcca gggagcatgt. aattatgtcc tatctgggee ttgtaatgac agccccctgc 
17580 

cactctacag ggagttrjccc tgctcagctg cccagaacct ttccctggga ggaaactaat 
17640 

ctgcttagcc cagattgg.nc gcdgttctgc acagcacttt tccgaatgcc tctgaaatga 
17700 

gtcctcactg acagaac.iqg cccactctgg gggaactgag ggctctcttg gtcctgcact 
17760 

gctctttgcc atacagatct- gtctgcccag gatttttctt gggtgtgtag gaggctgaga 
17820 

gagctcccct ttcttctcat ggctaaatcc cttggtcttt ccagccctcc tgggggttag 
17880 

aagggagagg gaaaaaaaaa aagactgaac ttgttgttgt - tgtttttgtt gttgttgttg 
17940 

tttgcctgtt ttctatgttg tcttgtgggg agagggtata agattgattg acagagtggc 
18000 

acacttcccc tgeaaattea tcatttgaat ttctcaggta agatgttcac atttctctgt 
18060 

taagatgetc caatttctct ggttaagatt tctctggtaa gatgetcatg aattggtgga 
18120 

ggtgttggcg ggatgtggga agtgtgcctg ctctttctga gttttggggg aagttgcctt 
18180 

aattctctgc atgactttct ttgctccttt gggcttcatt tctgtgcaat gtagtctgac 
18240 

atgaatactg ctcagggagg tgttgcttcc cactgcccac gccactggaa accagtagcc 
18300 

caggtttact cgagtcctcc ttttgaggaa cccaaattct ttcatttctt ttatgtgaga 
18360 

tctgcccaaa atgccattgg caagctgtac tgggttgaat agtgtccttc ctcctcccaa 
18420 

atgtatgtct actccaaacc acaggatact accttatttg ggaatagggc ttttgcaggt 
18480 
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gtaaccatta atagttatga tgaggttata ctagattaga atgggcccta gatcctatqa 
18540 

ctggtatcct tacaagaagg ccatgtgatg acaaagacaa agaatggagt gaggcaccca 
18 600 

aggaactcca aggattgcta ggaaacacca gaagcttgga ggaaggcatg gaacagattc 
18 660 

tcctctcgga cctctagaag gaatcagtcc tgctgatacc ttgattttgg acttctaqcc 
18720 " 

tccagacctg ttggggagaa tacatttcta ctgttttaag ctaccacgtt tgtgqcqatt 
18780 

tgtcacagca gccataggaa actaatacat acaacctgca caatgcctac tccagcattc 
18840 

catagcaagt caagggcctc acaattatgt ccaaaggact gatagaagag cgacctctqt 
18 900 ~ 

gctacttgtc cctcaggacg ctgacccaca gctctcaagg caggagtagg ccagagctca 
18 960 " " 

ttcaacaact ttgttatata ggggttccaa ttgtaaacct tttgaattcc tgtttgcaaq 
19020 y 

tagatgaggg ttgaaaaata aatggccact ttctctaagc cacatacccc aatctgtttt 
19080 

gttacttcat tacagctgtt ataatggcct cctcttctat cttccaatct ccataqccct 
19140 

ggttccttga tagttctttt tttttttttt tctttttttg aggcggagtc tcgcactqtc 
19200 

gcctgggctg gagtgcagtg gcacgatctc ggctcactgc cacctctgcc tcccaggttc 
19260 

aagcaagtct cctgcctcag ccacctgagt agctgggatt acaggcacct gccaccatqc 
19320 

ctggccaatt ttttgtactt ttagcagagg tggggtttca ccatgttggc caggctggtc 
19380 yy 

19440° tCCt gaCCtcgtga tccacccacc tcagcctctc aaagtgcggg gattacaggc 

atgagctacc gcgcctggcc agatagttct taaacaactg cccagaagtt ccagcctaqq 
19500 

19560 9CagC Catgaactgc att 9 ct catt tctgcttttt gaccttttcg atggctgaac 

tctaggccat ggaaaacaag gacccactgt atagttaaga gtcattttgt qactaqqqaq 
19620 ~ y 

19680 aaa " gCCtattctc caaatcccct ttccctctgg agttcctcgg tgccttaaag 

cttgtcctga gctacaggtg tgttacctgc ttatcccaaa atgcaggcat gttacctqct 
19740 y 

ttcctctgca aagagaggca ggcctggctg gggcacagct gaagatgtca aggccaacct 
19800 

aagggcagcc aagctatggc tgtctgtgac aagaggagag cagcggtgat gqgagggtaq 
19860 

9^ggcattga gttcatgtcc gggtttgcct cctac.cctcc tatcactgct tgatgatcct 

atcactgtct tgatgagttc aagacagaag tttgcctcat cattgccaca ataaaatcac 
19980 

caataacaga agtgtgaaag cagcgatgtg agtggaagcc catatataca caggqqqtaa 
20040 

tagagcagca tgattaaata tgtggccttg ttatcagaca ggctgatttg gagtcccagc 

tacttgttgg tgacctgaac tagaggaagt tatctaacct ttcattttac tcatttacat 
20160 

aacatggcta ataatagcac ctaccttata gggttattgt gaggattgaa tacaattatq 
20220 

caatataaaa cgtttagcat agtgcctagt ctaaattcct caccaggggt atgatgtact- 
20280 
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agtttttagt taagtaatta 
20340 

agcgttttgc tcaggttccc 
20400 

ttttaggaag aaacagttga 
20460 

gagggaaatg tcattatctg 
20520 

ccaaagacag agcagcccca 
20580 

tcccgatgtg gacgttatgg 
20640 

aaagtgaagg agaataaaag 
20700 

ctttctggtc tcaaggataa 
20760 

agcagctaca ataattacag 
20820 

ctgtaggcag tactgaggcc 
20880 

accttccacc cccgggcctg 
20940 

gagtggaacc atatctctgg 
21000 

gcttcacatg gagggccctg 
21060 

tcccatagtg ctgtggatgg 
21120 

taactaagga attaactatt 
21180 

ctttcttttt catccctata 
21240 

aggcagtttc ctgtaaccta 
21300 

tgtgtgccag gaattccagc 
21360 

ctgcaatggt gcaaccccag 
214 20 

tcactaaatc caagtcagga 
21480 

aatttgccat ggttgaggca 
21540 

tacttgtcat attcacgccc 
21600 

tctttcctat agaacacgtt 
21660 

ccactggcta agctttgtgc 
21720 

tacatcaaaa atataataat 
21780 

ccagcattct taccttgtcg 
21840 

tcctgagcag aaggagataa 
21900 

actgccacca tccaaggctg 
21960 

gtatctaaat gtcttctgta 
22020 

cctctcccat gccaactaaa 
22080 



gtatcctgga catgtcacag 
ccagcagtaa ttgtattccc 
tctaaagata gaaagtcaga 
ggttttcttt gatgatgtca 
ggacagggaa gaaggtgacc 
acttgttttg gagatgaagg 
aagtgggagg aagaagggct 
ttacatgtga aatcacttgc 
tacaggctgc agagggctct 
tctctcacta gacccatctc 
ttaatgatca aaaagcttaa 
gctcctgtga tgaaaaccac 
ctcttaatgg ccaagtgatc 
tgctgcctct ccccacgcat 
ctccagcctg attctgcttt 
tttggagatg agtcaccctt 
atgaagtgcc aaacaatact 
aggtgttgga catttatgat 
ccctgaccac tctgtgcttc 
attccaggtt aggcttgagt 
tgattgcaat cactgacaac 
tgatcacggc cccactcgca 
gctgctgccc tgttctggtc 
ttgcacatag ctggttgaat 
aatatcggca attttatgtg 
ctttgtaaac cctgctgctc 
aaggctggct cacaccccca 
aagagacttc ctttgtgtta 
ttcctttcct caaaagaaaa 
caagcaagca agcaaacaac 

16 



ccatttgacc tatctgggcc 
tccccaatcc cgggattagc 
gtactgtctg gaggaaggta 
gggaacatga caggctgctc- 
ttgaggttga ctcctctgca 
gaaagaaaga tggaatgtag 
gggaggagga tgggcaaagt 
cagtgggact ctggggctgg 
tgggcatgtc ttggagcagc 
ccagatcaca tagtacacac 
acagaacaat tacagcttca 
aagcctgtca ggctggggct 
tggagcaaga cccgtgactc 
ccccagaaga ggaagttcag 
tcccaatcag ggctttatac 
gccttcattt tacctaagca 
gtgatttatt tagtacttac 
gtatgatcct t'acactaagc 
ccttttcaca acacagcttg 
tgtgcagagc ccttaactga 
tcctcccggc tctacacacc 
tctcttccca ctttagaagt 
actgatcagc cctggcctaa 
cgtatgtatt gctgtttgtg 
tttcattcaa catgagggac 
tcaaatctcc actagctgtt 
tgtttttact ggtcacagtt 
gggctaaaac cttagtcatt 
aagtaccctc ttctgccaac 
aaagaaaagg tgatattaca 
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gatgctgctc agcctatgat ggggttacat cctgataaac ccatcacaag ggatgtaatt 
22140 ^ 

ccattgcaag ttacaaatac cataagtcaa aaatgtattt atttcatata acccacagaa 
22200 

cgtgatagct tagcttagcc tacttgatca tgttcagaag acttatattc gtctacaaqt 

22260 . - - ^ ^ 

ggacaaaaac atataaaaca aagcctattt taaaataagg tgttgaatat ctcatataat 
22320 

ttattgaata ttgtactgaa agtgaaaaat agaatggttt tctggatact caaaqtataq 
22380 

22440 aCtQa atgcatatca ctttt 9 cacc atcataaact tcaaaaattg tcggtcgaac 

cttcctgagt caggaatcct gtctgtacag ggtataaagg aggaaagcat cagctttqqa 
22500 

ggcaggtgga cctgtgtttg aaccctgatt ctgctagagc ttgacaatgc atattcgttt 
225 60 

tctattgcat aactaattac tacaaacaac acatttattt ctcagttttc atgaatcatq 

22620 y 

agtccaggca caattr.vgri gcagttaagg tgttagctgg ggctgctgtc ttatctgaag 
22680 

catgggggtg ggg^tgtLM ^tccaa-jgtc aggtggttgt tggcaaaatt aattttcttg 
22740 

cagctataga actcatnirt tgctt-ttca aggacacggg gagagagaat ctctcacatc 
22800 

ttttaaaggg ttcacctgnt laj-jt-caggt ccactcagga cagtttccct taaagtcaaq 
22860 

gcttaatagt caactgntta q^jdccctaa ttatatctgc aaaatacctt caccattgcc 
22920 

atgtaacata atcatggcaa ataa:: d cag gtcccaaatg ttcacaggtc ccactcacac 
22980 

ttgagggagg ggattat_atd gggcatgttc ttgcggagag aaggaatctt acagccacat 
23040 

tggaatctgt cttccatgct atttgacctc aggcaaattg actaatctct tgaaqqttca 
23100 

atttccttac ctggaataaa aggacaataa gatcagccat ataaggctat gacaaaqact 
23160 

aaatgagata gaataggctg gaaaagtctt gcagatagca gacacaagta tataacaatt 
23220 

tccctcctac tgttcctttt gtttttcacc tatcctgcag tctctgtcac ttcaaatacc 
23280 

atagaaaacc tttccaagca gcccaaatca tgcccccaaa tagtcacgtc tcattattca 
23340 

tagcagttat gttccataaa gttagcacaa actccgaatg agtgaatcct aaagcqttqc 
23400 * y 

tcctggagga aatacaggct gctggtcaca atatttttat caactgatca atatatacct 
234 60 

tgtcttatgt gtgtttctgc ttcaagacac tttatttaat atatacgttg attcattaac 
23520 

tctgaactct ctaggcaaca gcattataac tcctgccttc acaaagctta tctaacacac 
23580 

acatttcctc ctcaggcaca tcccagcctt cttgcactta ggattcagca gtatqcttaa 
23640 

gggccatttt caacagcaaa ctcatcagcg caaacacaaa catgtgaaaa acgtagcact 
23700 

aaagagactg caaaaaggac actggcttac agcatggaag ctggaaggag aaggcagaga 
237 60 

atcaccttgt tccacttcag ctatgaatat gcagtcaggc cacccagtca ttcaaatttt 
23820 

ataaatatac tctaatatat atataaatac caggcagggt tatttttttc ctcaaqtcat 
23880 
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ttttctaatt ttttttaaat gaatagatag aagagctgaa gtaagggtca ggagcaagag 
23940 

ctctgcttcc ttttcccttg ctgggcttcg ttagagagcc atcatctcct caatatgtct 
24000 

cccaactctt ctaggcattg gatgagtttg ctgcagatac gaaacccaac tttgccagtc 
24 060 

acttcatact aacaggtgaa atgtagtgga ggagcctttt gaagacaggg actcagcccc. 
24120 ^ ~ ~ . 

ccattagcct cattgcagac ctagattcct gccaaaatta atttggctgg aacttcccag 
24 ISO 

ccatggcatt gtcgacatta cacatcttcc actgtaatgt caattaccat tttattcagc 
24240 

cgaatgctgg agagttaatg ttcaagtggt tagagctggc tacgggtggg ctgaacaaga 
24 300 

tgtcttttcc ttcatttccc ctgcctgtgg tgaaggattg taaccagccc tggctggcag 
24360 

cactttgaag ctcacccaga gtgctcctgg ggacatcttc tacagagcct atcatttgga 
24 4 20 

catgctgtct tctgggcctg tcttccttcc ttccttcttc cctccctccc tccctctttt 
24480 

ccttccttcc ttccttcctt ctttccttcc atctgcttta aaaccagctg ccttgagtgc 
24540 

ttgtcttggc gcccctcatt agtgccattg caatcatccc tcctgcctac cctgctaacc 
24 600 

acagcttgtt agtccacaac agcaacagct gtgtgctggg gtgcagcagc tggagggcca 
24 660 

aaggtagggc tgggggacag ggtgttggga tggttttctg gggcagatga gtttat'acgt 
24720 

ttctttcatg tccccttcct cccacataga cttttatttc cccaaaggaa aacagaaaac 
24780 

aatgatctgt ttgacagtgt tgctatcatt gggcatcaaa cctatcatct aaggggaatc 
24840 

cccctgtata atcagtcagc caaatggagc aggaccctgt gttttgtagc tgatacaaca 
24 900 

gggcagcatc tctagtgagg gggccagggc ttctatttcc ttcattaaaa aatgaaacag 
24 960 

cagacctgat tccatattta gagattacac ttagttgcca ctgtgggtgt gcaggcacca 
25020 

accaaaccca gttggcaccg ttgtcttttc tctgcaatga tgtattgaat ttaataatgg 
25080 

aggtatatga aattcagagt gattggaact gaaggtttag gggctttgtg taaaattgat 
25140 

atgtaaggga tttggaagta ggtgagggat tcttccccaa tacttattca attttggagt 
25200 

caaataacca agcatttaca aatagccaaa aaagaaattg aaagagggtt taatccaata 
25260 

aattttcatg cctcatatga accacatctt ataataagaa ttatgctttt tcatttcata 
25320 

ctcagttaac aaatatgatt tgtgagcacc tggtaagttc agggcactag gctgaaaggg 
25380 

gttaccaaat gtcttcattt aacaaagtcc agctgagctc ttacaggtac cagaactgtg 
25440 

cctgggctgt catatgaaga tgaatgtaag agtgtgtcag gccttcaaga gcttacagtg 
25500 " . 

tgtcaggaga catcaaacaa gtgagccaat aaaatgatac tgccatttta gaaatagcct 
25560 

gaaattcatg gagttcacag tcttgttagg aaagtgaaac ataaacctat aagcattaaa 
25620 

aaataactgt tgaagacagt aacggaagaa tgcaactggc aactgaatga tataggttgt 
25680 
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25740 Ct9tt aaatatqatg aaaagagacc atgatgagct gaggcactcc aagagacttc 

tttttggaga tatgtttgga gccaaatctt gaagatttaa ttgctttttt cttttttttt 
2 5800 

tttttaggtg gagtctcgct ctgttgccca ggctggaagt gcagtggcat gatctctgct 
2 58 60 

25920 CaaCC tCtgCctcca ggttcaagcg attctcctgc ctcggcctcc tgagtagctg 

25980 aCa " Cgtgtgccac ca tacccagc tgatttttgt atttctagta gagatggggt 

26040 CCt9t t9gCCaagct gg tctc aaac tcctgacctc aagtgatcta ctcgccttgg 

ccttccaaag tgctgggatt acaggcatga gcactgtgcc tggccttttt tttttttttt 
2 6100 

ttaaaaaaaa aaaaaaaaaa aacaggaagt tttcgttagt ttttttgttt gttttacttc 
2 6160 

o^^ aaaa ° t^tt^^tc acatggaggt gaatggaaag agaggctgtg gcaacagacg 
26220 ' 

ggagactttt ctgatatcag aacccagtcc catagaccag aatgtatgct ttcaatccac 
26280 

gttgtctggg tccatcctat tgagtgccct gcccccacag cggggtatgg agaagagtca 
26340 

26400 a9CCC CagtCCtcaC g ta g ct caca atccagtgga ggagacggac tcagaaacag 

atagagatga agccatgaga tcagtactgt ccgaggccat ggccacggtt ttgtgggaac 
2 64 60 

ccacgagagg gaatgactaa ctgtggggaa gaagagggag aggaccaaaa tgcaggggaa 
2 6520 

gtgctcacag aggataagta agcagtgagg tgccatgaaa tgagtataca cctgacagcc 
2 6580 

?H taaCa9C tca g a 9 cct g ggtagagggg aatagagctg ctggttctct ggggggaaga 
2664 0 

gaggggtatg ggattctgga acagaagcac caaaaccagc aggttattgg agctgttagt 
2 6700 

gctcagatca gcaatgggtg cacaaccaaa ccattctcct agggatgagt tctttcctgt 
2 67 60 

??fo« a " gC ttctca gcct ggcttctccc gagaattacc cgggaagctt gaaaagtact 
2 6820 

gatgcctgga acctacctcc agagagttgg atttcattgt gttgacgtgg ggctgggata 
26880 

o^ 9 ^ atatt gtttaa g<=ac tccaggtgat tctgatacgt agctgtgatt gagaaccctt 
26940 

27000 aa9Ct atCCatctgc actccagggg tgctcccagg cccatctgtt tgtaaatgga 

o^fn gtCtt ga gg ta acaa atgtgccaag gctctggagc caagcacgcc tggctcctta 
27060 

27120 taCtt agtgacctca ggcaagttac taaatggctt aaactttaca aatccttaat 

ttgtaaaatg tgggcaatga tagtacctcc tcacaggatt attacgaggt ttacacggaa 
27180 

tactctcagc tcataataag cacttgcaca ggcctcatgg gctaggccct caaaacttaa 
27240 

27300 CtaCa " Caacagcc atatgaaagg aattttatac caccaagtca aaaaatctgt 

gagcactgct cagaagcaaa agcctgtctc caacagcgct catttaaggg gtgggcgagc 
27360 

2742? a9a9a agaatgagcc cc< = a <=agggt aagctgggga aagctgggga cagaatgaga 

ctcaggaaat cacttgaata ttgattatat ttgtgctcaa taataaaata acgaaatgaq 
27480 
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tacagcccta gacctaaaca ttgtgggtga 
27540 

ctgaggaaaa actctaaaac ggtgacttct . 
27600 

atttagtcta agtcaataca tacgacagaa 
27660 

taaaataaga gaaagaataa aaaacccttc 
27720 

ttttagttag gtctcctgga gaggaggttc 
27780 

gcaatcatca aaaggtaatg catggactta 
27840 

tttctgtgtt tgcctttgag aaaatcagcc 
27900 

tccttggtca agattagtgg gaccaagaat 
27960 

acctgatgcc ttattt'caca caaatttgca 
28020 

ggagaactgg tgatcaaatg atgactattt 
28080 

accccacatc ctgctttcac ctgaatctga 
28140 

tttgaaattc cttttctgag ctctgcaaga 
28200 

ctgtctggga gagattggaa acctgtcttt 
28260 

cctccatatc aggcctgctt ctcccattgg 
28320 

cttctgattt ctctctcctg gctcacatgg 
28380 

gctccgtgga acccattggc cagctgggct 
28440 

ctggtgagag ggaggccatt ggagtcactg 
28500 

ctaggtgggt gtcctcttca gagaaacggg 
28560 

gtcagccagg gaggaagaca gagagctgat 
28620 

gaaaggaggg taatattcct aaaatgatag 
28680 

cctgatattc attaaaacta aacgctattt 
28740 

cttcccacta agtctgaggg gagcttgttt 
28800 

cccatgaact cttttgtcct actgaagaga 
28860 

atttatgaac tgctgctttg cagtcccgct 
28920 

tccaaagcca gaggatgtaa atctccctgt 
28980 

cacttcagca agtcacctaa cctgcaaatt 
2904 0 

gctagctgtg agactttgag aaatgtattt 
29100 

caatgggtat aataatgtct accatatatc 
29160 

tgctgaatac atttccaaca gagtctgtgc 
29220 

agctaaccaa ctaacccacc aacaaaccaa 
29280 



ggcaaaggca atgcgttaat tttgcatcca 
tttttaaggg accagaagaa tctagattat 
ccttgccctc tagacttgat aagaaagaag 
caccaaaata ctaacattca gataatgact. 
cctcagaaat gaatagattt ctcttctagt 
agtgtgatcc ccaagagaaa atcaatgacc 
agtctatggt taaattagac atattttttc 
gcagtcttac actccttcta gcaaagaatt 
aagttgtatg gacgttgtat cttattttaa 
caatagtggt tcatttacac caccaccctc 
acgatcatag tcagtctgag attctgaagg 
acagcatctc ccaagagagc tcagggcaga 
tgcagtaaca tgaattggtt gaatggtcac 
gtttctgatc agcccaactt gggtctcacc 
ggctgcactg gccattaggt gccaggcttg 
ctgtgg^gcc ctaaggcagg gctctggtca 
gggtggacct acagacccta gggttaacag 
ttacaaagtg aaagaaagtt acactgtgag 
ataagatagg tactgattcc ctggggatgt 
catttagctt ccagtataca ttaattgatt 
ccttgatgtc tcatccaaag ccgcaccact 
tgttgacaag tgtaagaggt tgaagaggga 
tccacagatg gaaacaaatg ctcctaccac 
tctgctatca tgcacaggaa ctgactaagc 
aataaatgta agtcatttat tagctacata 
tcaagcatgt gaatcttgga tctttcatgt 
aatgtctctt tgcttccttt tctacccaca 
tttgcagcaa ggtctaaatg gggtgataca 
aatgataagc tctttccaaa tgttagttaa 
cctcttagcc aggactgatg gaaggagtct 

20 



3NSOOCID: <WO_0233086A2_I_> 



WO 02/33086 



PCT/US01/32152 



29340 a<3aat g ° atttaaaa cactt 99cac catgcctgac aagagtaagt actcgataaa 

tcagttattg ttattatcgc atcggtatta tgaccattat cctcttctct ataggcttca 
29400 

ggttttcctg tctttttatc acagcagtat tccagcagaa gcctttgatt taactaagtc 
294 60 

tctactgtgt gtgtggctag atgctataaa gcatccagag aagtgagaat ttggtcctgc 
2 9520 

ttttaagtag cttatagtct aattaggggg aagtaatcag atagaaagga aactaacaat 
29580 

atgcaaaagg aaactcatag tttgtggtaa atgccaggtg ctgctgatag tggcttcaga 
2 964 0 

^9700 CtCat agatgctata SSaggtcaaa ggagaagcgt gcagcttgag ctaagttttc 

agggaaaagg gtgaaagaat tagtcattaa tgtacaccta cattacctgc cagactccat 
29760 

tcaaaaatat tcttaccaaa tcatcacaat accttgttgg taggtactat tactatttta 
2 9620 

cagaggagga aagtgaggca aagacacatt aaataatttt cccagaatcc caaggtgtqa 
29880 

29940 39Cad 9gaCacaaat ccat( 39ctct aagtccctcc tagtatatcc tgcaaacaca 
30000 aatta atgcagagag gaaggggaga ggcagtgttc tgcaggagtt cagagccatg 

ataacccttc ttgtgtggct tttggtaagt tattttacct cttaccctct gtttccccat 
30060 

30120 Caat9 aaggttgtat atacacacat tatatggccg ctgtaagtgt gcagtgatat 

gatgcatggg gactcagttc atgaggcagt gtgaattctg aaggtatcac aatgggacag 
30180 y 

gtgttttttt ctccactcat tttctccgaa agtcttttgt tttgttgccc tccctctttq 
30240 

gggcatatgc tttcagctca taccttaatg acatcagaat ctgcaatttc ctggcaactt 
30300 ~ 

ttgtggttaa aattattctg cccttccatt ttaaagcact aatagcaaag gtattaggtg 

30360 * 

caaaatgatg ataaaaataa ttgcaatttt taccattaaa agtcatggca aaaccacaat 
30420 

tactttggca ccagctgaat attttgaaac tccctactct gatgttaacc aagttcatga 
304 8 0 

3054C> a9aaC ttgcagaggg gtaggggaat ttcaagggaa agggggagat gcctggggtt 

gtcacacact ctgtctttca tcctctattg acatgttggt tatttggaga tggtattcag 
30600 

ttccactata gcccctcagt cactgtagac cctctcaaag gggcaatcat gtttccctta 
30660 

ggtcaggtcc attcatctaa cccctctccc gggggcatca ccttgtttgt tccagcaqct 
30720 * 

30780 9CCaa actGaCaCCt cctcctcac c ctctagccct tatgatctgc tttggggagc 

catgggaacc cctagtttcc tctttcatac ccactgagat tcacaagtaa ctaaggtcaa 
30840 

3090o" Ctt CattgCCttt ct g c ^gatac cttacgctac tgttcctcct cgcctggctg 

gctccacact ccagcagacc ttctgctggg cgagaagctg caggcctgaa tctctgtgtt 
30960 

31020 at " C CCCaactctt ggg a ttacac tagctcttgt aagaactcaa tgctctgctc 

tgctcatttt gatgccatca aagagggctt gcaagttacc agctgggagt gaacaccagt 
31080 
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gtcctctttt tagaggtacc cctaatcttt ctgaacaatt ttgctggcac cccttcactt 
31140 

ggctttgccg gggtaagagg gggcacttct ctcctttccc tcatgaaagg agggagagaa 
31200 

gccaaaaatc tccctactag tcaacaactc aggcacccct ccttctctcc tctattttat 
31260 

agactgggaa gggagtgatg gttgttggag gtggcagagc cagttcagct gccttttgtg. 
31320 

aagtcctgaa ggaggtgtct atcctcaact gctggcttct gtccttaagc ctggggagaa 
31380 

ttaagtcctc tttgcctcag tttggcactc caattgccaa cattgggaca gcaggaaaag 
31440 

ttccatccaa catcccatta aatatgtaat gtgtattagc acagcgcctg gcactgggca 
31500 

ggtattttct aagtgatagc caatgcgaag cctactttat tattttcctc tttgcttaac 
31560 

ctacaaggtg tctaagacca tttgtttgtc cacacatagt aagataaaca gcactgagac 
31620 

tgtggtcctt tctgccctgt gtccttatcc cacctgggaa tctggaaagc caagcctaga 
31680 

cacactcgtt ccacaaatgt ttactgaagc ttgttctatt caaagcactg tacagctaca 
317 4 0 

aagaccatct tttctgaact ccaaaccagg ccacatggtt ggaataactt caagtatgga 
31800 

gaccaagaga aaaggtggtt gttgtcagca aagctctgag tccacacctt ccaggaactt 
31860 

atagttgatg caatggtggg agaagtctga acctggattc aatctgcttg attccgatga 
31920 

atggtgcagt aggcagagcc atgagttcag agcaggaaga aaccactggt tcaaagaagc 
31980 

atctgtcaca tcgaagctgc tttatagtct gttgggaagc atgcataata atttattctt 
32040 

tctttctttc ctttggtcaa caaagatttc ttgagtccct actatgtgcc aggtactctt 
32100 

ctaggtactg aagatgcagc agtgaacaaa gaagatacaa tccctgccca gcggagctta 
32160 

cattctagtt atcgaaagtc cctttctcag tggctgctct ctttatttga gaaaccatgg 
32220 

gctgttctcc tcccatccta gggctgctgg ctccacagag gcacacagtc catcaggatg 
32280 

ctctgccagc cacccaccca ctcaagacca agggttacgc tgtcagtgtg agcagggaca 
32340 

ctcccgtctc tgctacctcc tttctcctga aaacaagatc tcagggaaca tctgccatcc 
324 00 

attttccctc cctggggagt gacaggaaag gtgtatggag gagattgagc ggagtgatgg 
32460 

attgaggcac tgtgaaagtg aatcattgcc tgacatggga atgaggagac ttgcttaaag 
32520 

gacaagccat gctaagtcat ccatcgttct cccctaagga ggtgaattga agttcccatt 
32580 

tttcccaggg agccaaatta acaaggtgct gggagatttc caaattagaa aaaaaaaaaa 
32640 

aaaaaggcac caccagctct caaatcagag aggctgttga gttgtttttt ggagcagatc. 
32700 

attgtatttg gcatctaacc ttgaaataga ggagaaagca tggaatttct gctgaaaact 
327 60 

catccttctc tgagcaggtg gtacaaataa gcatcgttgt gttctcagag gcaggaacca 
32820 

catttgcacc ttgataccaa ctacctcaat aaccacagtg ctgaattttc acaaattgcg 
32880 
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aattaggaaa ttgttgctca ttttacaatt 
32940 

cagctacccc agtacttttg aaatatgact 
33000 

gtggctcaca cctgtaatcc cagcactttg 
33060 

aggagttcaa gaccaacatg gtgaaaccct 
33120 

gcatggtggc acatgcctgt aattccagct 
33180 

atcacttgaa cccaggagat ggaggttgca 
3324 0 ^ 

cctgggtgac agagcaagac ttcatctcaa 
33300 

tataatattc tcaatatgaa ctagtagaaa 

333 60 

tggttcccag ttttatctct gactctaatg 
33420 

ttctatactt cataggagat gctccaagac 

334 80 

tataacagga tgggccttaa aagtagcatg 
3354 0 

attttcatgc gtgaaaatgc caaggatgtt 
33600 

gtgagcctgg cttagaacaa ttccatttta 
33660 

agtagaagag caaggtaaga aaacagctta 
33720 

tgttgtgaga aaaataagca ggatgctttg 
33780 

cctctctgat atggtgatgg ttaaagcaaa 
33840 

tgaagggctg gaagaacagt gttttatggg 

33 900 

gcgagtgggc tcattatgtt ctagaacatg 
339 60 

aagatggcac aaaaggtgag tgaggtggac 

34 020 

tctcagaagc cctgcgtgtt ggtttcttgg 
34 080 

tggcgtaaaa caacaaaaat gtattgcctc 
34140 

ggtgctggca gggctgcgct ccctccaaaa 
34200 

cctagcttcc agtgggttgc tagcaatcct 
34260 

gtcacagggc gttgtctttg tgtgtctctg 
34320 

gtgtgtgtct gtgtgtgtcc actctgaggc 
34 380 

ttatttattt cattgataaa cataatagtt 
34440 

gatacatgtg tacagtgtgt gataatcaaa 
34500 

aaacattttc tcatttcttt gattggggac 
34 560 

acaatagatt attgtttact ataatttccc 
34620 

cttctgttga gggtgtactt ttgcacccat 
34680 



tggtttccct caggattcct tttaagtagc 
tgcttataaa aatttgatag gcttggcacg 
ggaggccgat gtggggtgga tcacgaggtc 
gtccctacta aaaatacaaa aactagccag 
gctcgggagg ccaggcagct aggcaggaga 
gtgagccaag atcatgccac tgcactccat 
aaaaaaaaaa aagatatata aacaagtttt 
aaaagcatgt gtttttaggt cttagaggcc 
aggtatagta ttacctacat tgattagccc 
tgctagcttt cttcattcaa taaagagaga 
catttcttct ttcattcact cattcaaaat 
tggtcaacca actcttccca gaccctggct 
atggtccatg ccctcaggca cttgtattct 
aaaagttaaa cagttttagg ttgagatggg 
aacctatgca ggtaggaagg tctggaaagg 
accaaaaaga ccaagaacac atggaacaca 
gaaggactag tacacacaaa ggctgcaaag 
ccaaaaagcg ggtgcagctg gagagggagt 
aggagcctta tcacgcaggc ttacacaggc 
gactaccgta acaaagctcc acatactggg 
acagttctgg aggccagaat tccaaaatcg 
cctgtagagg agaatccttc cttgcctgtc 
gggctgggtg actccagctc tgccttggtt 
acttcacata gccctcttct tcttcttttt 
acagaagttt ttatttattt atttattcat 
atgcatagtt ttggggtaca tgagatattg 
tcagggtgat tggaatatcc attcacctcc 
attataattc ttctagctat tttgaaatat 
tgctgtacta tcgaatacta gaacttattc 
taaccaactt ttctttatgt cctccttccc 
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acttccctta ccagcctctg gtaaccacca atctactctc taccaccatg aaatcaactt 
34740 

ttttttttta tagctctcat atatgagtga gactatgcag tgtttgtctt ctgtgcctgg 
34800 

cttatttcac tcaacataat gacctccagt tctgtccatg ctgctgcaaa tgacaggatc 
34 860 

ttatttattt ttttatggct aaatggtatt ccattttgta tgtatatcat atcttcttta- 
34 920 

tccattcatc cactgatgca tatttaggtt gattccatat cttggctatt gtgaatagtg 
34 980 

ctccaataac catggaagtg aaaatatctc ttcaacatac tgatttcctt tcttttggat 
35040 

atatacccag tggtaggatt gctagatcat atggcagttc taactttaga ttttaaagga 
35100 

acctccatac tttttttcca tggtggctgt attacttaca ttcccaccaa cagcatatgg 
35160 . 
tcatctcctt tctccacatc cttgccagaa tttgttatat tttgtctttt* tgataatagc 
35220 

cattctgact ggggtaagat gatatatcac tgtagttttg atttgcattt cccttataat 
35280 

tagtgatgtt gagcattttt ttatatacct gttggccatt tatatgtctt cttttgagaa 
35340 

atgtctattc aggtcttctg cccattttta agtggattat ttgttttttt gctactgagt 
354 00 

tcttcgagtt tcttatatat tctgatacac agccatcttc ttatgaggac tccagttata 
35460 

tacgattaga gaggtccacc ctttttcaga atgaaattat agcttaacta attacatctg 
35520 

tagtaactct atttccaagt aaggtcacat tctgaggtac aagggtttag gacttcaaca 
35580 

tatgaattcc agtgggacac agctcaacac atgacaccat ggtagggaac tttattctac 
3564ff 

ttgcaagttc tgagtgtctt acgcaggtag atggactggt gtgatgtatg ctttaaagac 
35700 

cgctgtgtga agatggcctt agggtgatga ggatggaagt tggagactaa taaaggacta 
35760 

agaaaatgct aagaaaatcc aggtgagagg tgatgatggc agaactaagg tgatagcagt 
35820 

agagagaaga gaagtggatg gagattagac atcttttgca gaacgaatga caaaataccc 
35880 

ctatggattg gacatgggat gaggaaaagg aaggacttga gggtggtgtc taggcttttt 
35940 

actttaatcg tgaagggaag ctggtgccat ttaccttgtt cggacaaacc tggagaggat 
36000 

caggttaggg actgcgagtg gtatggacgg caaaggaatg ggaagaatgc agggattaaa 
36060 

aattggaaat ccccctcccc agtcaacaat atcttacttt tatctgaaaa atactaagta 
36120 

aaaaagcatc cttttgttgg aaagctcaat ccttgttaaa atgaagacat ctctgggaga 
36180 

ggaaacatag tgagcacctt tcccaaaagc agccactgat ttggagatga gacagagtag 
36240 

catacaggac atcagagaga acatgctcag gacagaaaga gcaatgtagg acaaggcagt 
36300 

gtcttggcat cacagtcttt cctccgactg gctgtgagca agtgctcaat ttaattccat 
36360 

ctcagtgctg ggtcaggaca agtgcccaaa agcaaattga caaaagtacc agcatgatgg 
36420 

agttagaagg tagcaagttc cctccacaga gcccagctgg aaaggaagat agaggggaag 
36480 
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ttgacccctg gggatgggga atagggtgag aggagaacat gaaactgaga aaagggcttt 
36540 

gagtgaaatc taggctaaaa gctaaggttt ctttagaaac ccaccattga cccaacatga 
36600 * y 

ccagggcttt ctcttgactt gattattttt gataccccat cttcttctgt attcctggaa 
36660 

ctagctctcc caagccccag aattgtgctt ctatcagagc tgggttttca tcagagtctc 
36720 

ccctttatcc tgtatctctg ttgccctatt ttgtttgaat tcctgccagg tcagctgaat 
367 80 

ttgggcattt ggggtgaaaa accatcaagt gtggcatcct ggctttggca cctggcacaq 
3684 0 y 

tgtgacccca ctggtctctc cctcacattt gctgtggtcc gtgcacggaa tttgtcaaaa 
36900 

gacctcctca gtatcagctt tcctgcagcc tcaatgcacc ttgttctgaa taggatatta 
36960 

37020 C3aga gtatattagg 9 catt ttcct atgccagaag gggtccttag gcctcttgca 

gttttttctg ggtgacagtg aaggaggagg tggctgcaga gcttactgcc tgtggactqa 
37080 

37140 CCa " 9CCtggtgtc aggaccattt gtccagcctg ttgagtgaag gtcattctgc 

ctaaactgta agcacaagag agagttcagc atcatttgca tcctatttta ttgtctttct 
37200 

37260 tttCt ttcaaggcct ca tttttttt ggcttgaaca aatggtaaag gccattttat 
37320 9taCC aagCCaaact ttcctt ggtt ttgtggccat cctgctgggg aaggaagtac 

tcctttactt taaataactt taaaaacatc tgtttggtct caggggctgc agctggaaag 
37380 

attttctaac taatacttgt tttatggggg tgtttttggg ggggtttatt gagtgtcaaa 
374 4 0 

37500 Cagta aattagaatc agaagacaac agttagtgat aagcagagaa gccaaggatg 

ttaccatagg caggcagcag agagagggga attggtggct ggccccccaa aaacagattt 
37560 

gaagatctcc ttctgtcatg tagtgaatcc ccaagtgcct . agggtgggct gtgattactt 
37 620 

gagctcctgt ctccactgtc tcagctcact tgccttgggg tggacacaca acacacattt 
37680 

gctcatagca tcaggtattc aggagcaaag agctgaattt atctggttaa tttagatacc 
37740 

cctaccccct cttttaacac cagattgcca ggatcatgac ctcaaaaggc taccctgaaa 
37800 

37860 ttgaC aaatgggatg aa agatttcc cgtttcatcc acatttgcct cctgagctac 
37920 9Ca9C aggtCaccgc agccagagcc cacctgcttg cccaccatgc ccgcacacag 

acaatgctgc ttctgtggct ggaggtcgga acacctcagc actatctcag tttggctqca 
37980 

gatcctctgt gtgcttggta aacaggtttc ctcatctgta aaatgaattg gctcttccac 

3804 0 ( 

aactttttta aaagcactaa catattagga ctctcactaa atactcaaat gctaaactca 
38100 

38160 taaaa gagtgcaaag ggatgggctc ccaaatatta cagtgaaggc tgcagcattt 

tctgaccttg ctgctttttc tggtgagtgg cttttatttc ttagtttggt ttcttctctc 
38220 

ccattctaat caagcaagaa gtgaccacca aaaggggcac tcaccaaacc agaacaaqct 
38280 * 
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agttctttca tctttaattc attgcaacca aacagatgcc acagaaagag ccaagggctc 
38340 

caggctttag ctccagcctt gccattaact acatatgtaa gtcagccatg ctggtctgca 
33400 

ggttcttgct ttgcatgatc aagggacaac ttggaaggtc tccaatcact ctattccccc 
38460 

agatggaaat gtattcactt atttcctgga gatgtctgtc ctcctcccag ttaaagacag. 
38520 

accttgaccc acctccactt ccttctctgt ggccctgtct tatctgtcct cttgttcttg 
38580 

cctcttcaat tgttctctca ccgtgtttgt cacttctgag ctatcactgt gatccccctg 
38640 

attgtttttc taatgtccct gaacttcaac ctgattttca cgcatataca atgtcttcct 
38700 

aaacacttat agactctgac acattctgta actgacacat ttccctttat caaatgcaat 
38760 

ctaagaagct cacagtttct ctcagtttca acaagagaaa tcaggagcac ttgaattata 
38820 

caacttgaca ttattaiggc tgatqtctga ttttgtcctg tctgcccctg tcatttctgt 
38880 

actacctttt aczc\adccnc tcrtatgacc tgtgtcctcc tccagctcca tttgagaaca 
38940 

cctgctgtat accctgt^n:; ctrouitta ttatgttcgc ctcaatgatg aagaaacagg 
39000 

cttggaagtt aaatui-t.i cc j :*Qgccc acagcctgga acctaggatt ccaaccaaac 
39060 

cttgtctgat tctaaac;c;it o^jj }.*agct ccatactctg cctccctctt ctacatcatt 
39120 

tcagtttctt cactttcjca crtcc^attc tcacccaaac tgaatgtctc acagtctctg 
39180 

tgcccccact ttgctccatc ccttggcctt ctgcagtcca agctccattc tgagatcatc 
39240 

caaggcttct cttctgtgtt gatccttggc cttcttggag tctctttctc ccatgttctc 
39300 

cacaacagag cattctcctg actgttttca ttctgcatct cactctttca tcagtatctt 
39360 

tttctctacc atgccccata aatttgggtg ctcctgaggg tcctgtcctt gtcccctgct 
39420 

ttcttgttgt acaacctcct tgatctactt catctactca agtttggtcc acaatttcta 
39480 

taftgtgaag attcaaatct gcatctctag ccatatatcc atttgcctgc taggcatttc 
39540 

tacctgaata ttttataggc atgccagtgg ctcttactct atggctctta ctctaagtct 
39600 

agactacagc agaaagcaat gctcttttta ttaaggcata gtgcctcttt cagaataatt 
39660 

tacagcatac aaccaggcct gctgtgcagc attacaattt gtcattaaaa ctccattcct 
39720 

cttgccagag taaatgagcc atttacagcc agggcgccaa gatggactgt tgttattttt 
39780 

tctgcctttg tattatgagt .attcatggct ctcctcagac aagctcctgg ggattcccag 
39840 

tggagttgcc ttaacatgca ggtcaattag ccaggctcaa gggtagtttc ctggatattg 
39900 

gtatccccct tgcagaggac tgcaggaaag ctgaacagtg ttcccccaat gtgggtggtg 
39960 

atcctgagaa atatcatttg tatctgcatg tgctgtctca cacacactag ctcacatgtg 
4 0020 

cacacacacg tgcatgcaca ggacaaaacc aaacacaggg caacccagca tctgcccccc 
4 0080 
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agccatcagc attgttacac ctttataqgg 
4 0140 

tcaggtgagt tgagaaaagt tattaaatct 
40200 

tctaaaatgc atgcataggc gggctcagta 
4 02 60 

tcctaccgat aatctttcag atctcagaat 
40320 

gacacagacg aagcagagaa cagtagaata 
40380 

agagagtgag aagattcact ggacagctag 
40440 

aaatttgccc taaggcctgc aggcaatagc 
4 0500 

ctcctccaac acatactgcc ctggcagqta 
4 0560 

ttgagctatg caaaaaatac tggattctgc 
4 0620 

aaagcacatg ccctcctttc tctgcctctt 
40680 

cccaattctg ttttctaagt aagaccactg 
4 0740 

aaagtctgat ttacttgaat atacaactat 
40800 

agggaacata tctgatttat ccttatctct 
40860 

aatgcattga aatattcatg aagcttacta 
40920 

aagtgcctta aactttgttt ttctcttatg 
40980 

ataggattgc tgcaaggatt aagtgtgata 
41040 

ggctcacagg aatagtagct actaccataa 
• 41100 

ttatttagcc ctccttcacc ctgtgtccaq 
41160 

acaccaagtt ttctgtggga gcttccctag 
41220 

agctcatgct gttacactct cttcctccac 
41280 

gagaagggtg tgatctgaga tgggcttggg 
4134 0 

tgtataggaa aaagtgcctt cccaaagtct 
41400 

gggatttaga cccagagccc aagccaataa 
414 60 

ctgctatcac cacaacagct atcataccca 
41520 

ataattgaca aatgtcaaga catcctacat 
41580 

ttaaattggt agggaaggaa aacttttata 
41640 

gagctaaaat aaaggtatgc caagaacaaa 
41700 

gcctgatcca ttgattagtg aagaggaaac 
417 60 

tccaagtttt ttataatttt gcttcttcgg 
41820 

caaattaact gaaaaatgct aaaaatgcaa 
41880 



ggcgggaaca ggttggtcag caggtgaacg 
taaatcctta aggaaagtta ttaaatctct 
actaacatgc aaatgtttag ggtctgaagc 
tccagcccct tgtgctgttc tgggttgtct 
aacagctcag .taaacaattc attgagggaa 
aggaggaaat actgctggtg actatggaag 
ttggtcttat ttatcctggt gtcccaccct 
cgtagaagat gcgtgaaaat atcttttgaa 
cctccaagag tttactgttt agtttcacag 
gaagactgac ctatctttca aggccactgg 
agtcagtggt gacctctcct tctccctaac 
ctccctcttg gcctgtgaat ttcttgtgtt 
tccacagtac ctggtgtaaa atgcccaata 
aatgctctgc cttatgagcc atgaaatata 
taaaataagg ataataataa tgacacccct 
atatatataa aactcttagc acaaacacct 
tggtaacttc gagggcaagt tttctcagag 
gagtgcagat cagaatggtc agattccagg 
gaatataact aaggaattta aatcaggttc 
tcaggcattg ggtgtggctt ttccaagctt 
tatagagggg aattatattt aggtctaccc 
ccctggccta aagtataaga gatatgtgtt 
tgggaccccc ttctcacatg tggctacctc 
taactacaac agaggccaat taacgtggtg 
tgaggcacac tgtgcgtttt gcgtgagctt 
cctacaccta tcatggaagg cagaaggtaa 
ggcaggaaag aagggtttta acaacttgag 
atgttcaaaa accactctat aaccaccttc 
atatcttctc atcatagtct taaatgccat 
ccactctaag agaatgggtt agatgggaga 
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tggctttgtt aaagaagtcg gtcttaaagc 
41940 

aggaaggaca tttttggtca agagaagaaa 
42000 

gtaaaatatg gctagaacag ggtgcagagg 
42060 

aggctgaaag gtgtagacat tataggcaat 
42120 

ggtgacatga tagatctgta ttctagaaaa 
42180 

agggacagga taagggagat agataagaag 
42240 

ccaagtggta tggtggaaag gaatgagaga 
4 2300 

tcaatccatc ataacaggat gtataggatt 
42360 

gaggaacaag tttgtaaggt tttgggggcc 
42420 

ttggatatac caatgggaca ttcatgtgaa 
42480 

aggataggat cagaatgagg gacacagttt 
42540 

agataagggc atagatgagc ttaccaaagg 
42600 

aggctaagcc tgtgctattc tttctcctca 
42660 

cagtgagtgt catgataaca ctactgtggg 
42720 

ctcctctata aaatagaggg ttgaacaggg 
42780 

agcacacaac aaaccagcta ctatccaaag 
42840 

aaatccagca cagggaaaac actttctggt 
42900 

acccatcacc atcaccatca gtagctttca 
42960 

aacaattaga atggagagct aactctttgg 
43020 

cgtcaataaa gcgctaagat gcctgggcgg 
43080 

aagtctacct gctcagcata tggggcccag 
43140 

cacttcatct atggatceag tcttggtatc 
43200 

tttggttctc aatggtaggt gactggaata 
43260 

tcctttctgc ccccttatag tttaatttgc 
43320 

ccattatagg gaaatagaag gcagacatga 
43380 

tcacattgtg ctcaacaatt aggaggaatt 
43440 

gtcatagcat gaataaactc aatataggtt 
43500 

agaagtgttt ttggattttg gatttttttt 
43560 

cagcatccct aatccaaaac tgaaatctaa 
43620 

catattggca ctcaaaaggt tccaattttg 
43680 



aaaagtaggg ctttgtcatg gtagtatgga 
gtgcagggcc tgttgaggaa ggaatgagta 
ggaagaactt cagagaatga ccaaataaac 
aaagcaacca cagaggtttc taagccatag- 
gttagttttg cagcagttgt gtccattgaa 
acatgctatg atgataacta gatttggata 
acagggtcac agatgaatga ctgcccaatt 
gcccttaagt aagatgggga atccaaaaac 
aatgatgaat tccatttggg acatgttgct 
aatgatctcg gcaatcctat cctggaattc 
ataaggtaaa cagaatggag gtgatataga 
ggagagttta gaatgaaaag aaaagaccaa 
caatacgctt cagacctggg cacaaaccat 
caaatccccc ctctataagg gcctgatttc 
tggtccatat cctgttaatt gtgtttggag 
gggacatccc gaggcaggac taagcaaagg 
gctggtccca gttaggcagc gttcagttta 
gctgctactg accacactta taggaagaaa 
aaatggtcaa agaacacggg tctacaaaac 
ggtcaaaaag tctacctggg cggggtcaaa 
acatctgacc tttaccaact ccacaataac 
acctagtcgc tgttttcaag taacagaata 
cagcttactt tctcccaccc ctaccgccaa 
ttgtaaatta cttgggaata catttgggag 
tgaacagaat gcagggtgtt ttttattact 
ctagaagccc ctcccagtgg ccaggaattg 
gagtattcct tacccaaaat gcttgatacc 
ttgaatattt gcattatata cttaccagtt 
actgctccaa tgaacatttc ctttgagtgt 
gagcattttc aattttgggt tttgggatta 
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gggatactca accagtggta ggtttgggat 
43740 

agctgggaag ggtgggagga acatggaatt 
43800 

tactattagg gccccaaatt tgttctaagt 
43860 

gagagagaga gagagagaga gaattttctt 
43920 

aaaattttgg gtttctttgt attctccctg 
43980 

ggaattcatt aggtccttga tttcagaagc 
44040 

gggtggcaag ataagttgag ggacttttct 
44100 

ttgactgtgt atatctaaga atgggtgggg 
4 4160 

ctattattga ttgattgatt gattgattga 
4 4 220 

aatgtttcca atggctacag cagactggag 
4 4280 

ggctttccat ttcttctd^t tctggggcat 
44340 

agctggactc tgctg^r Adctctttag 
44400 

tgcaccttag aaatgat:?:j qratjaactgc 
44460 

agagaagcaa agtgattt.jt tcatgatcat 
44520 

gatccactgt cctttcctta aaccagrggt 
4 4 580 

agtgttggat ccatctcaga ctaattaaat 
4 4 640 

agtgggtttt ttgttctttt ttaaaaaaaa 
44700 

aatagaacta ccatatgatc cagcaatccc 
44760 

attattatat aaaaaagata cctgcactca 
4 4 820 

aaaaatatgg aatcaaccta actgtccatc 
44880 

atatacacac acaatggaat actattcatt 
44940 

gcaatatgga aggaactgga agccattctc 
45000 

aaattccaca tgttctcact tacaattggg 
4 50 60 

gtgtggaata atagacattg gagactcgga 
4 5120 

tgaaaaatta cttaatgagt acaacgtaca 
45180 

acactttacc actatgcaat atggccatgt 
45240 

atacaaataa aaataaataa ataaaagctc 
4 5300 

atgggtcccc agctttattt taactcaaaa 
45360 

aaaatattta ttgagtatct cctgtgtgca 
4 54 20 

gaaattagac tggggaaaaa gacaaggtca 
45480 



gatatcagca tgctaaggtc aaagagacct 
ttcattctct gggcacccct tgaacagtct 
gtgtgtgtgt gtgtgtgtgt gtgtgtgaga 
tcttccttta tattctaagt tcctcaggac 
cagctcctca tgtagttcta agcaaataaa 
ctcccagttc tctatgtagg aggaatctta 
tcaagcacat ttcacaagta agagaaaatg 
ctcaatgatg cccccctaag ttactcttta 
agaagcaatg ttttgattga ttgaagaagt 
caaaagaaca aaatgaaaga aaatacatta 
ctgatgaagc tttggatccc ccaaggtaag 
gaaaaacaaa agaatattgt cagaatctga 
tttattttct aaaaggtgaa atggagaccc 
acagctattc agtaaagcca ggacttctgt 
tctcaacctt gggagcttta aaaaactgct 
cagaacccat ggggatgagg cccagacatg- 
gctccctagg agatttctca aagaactgaa 
acttttgggt atctacccaa aggaagataa 
aatatttatt gcaacactat ccacagtagc 
catggatgac tggataaaga aaatgtgtat 
cgtaaaaaag aacaaagtct gtcttttgca 
ttaagtgaag caactcagaa acagaaaggc 
agctaaataa tgcatatgca tgggcacaga 
agggtggggg gaatgggaga gggtcaatga 
ttatttgggt gatgaataca ctaaaagccc 
aacaaaattg cccttacacc ccttaaattt 
cttagggctg agaactactg ctcctgtcct 
tgagtttaga aaaatttatg aacccattta 
aggcactgtg ttatgttaag tggctgaagg 
tggcctaggt ttcaaactaa tataaaagac 
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ataacaaata agaaaggatg 
45540 

tgcagatgtt gctaattcat 
45600 

accttttcaa cccacggctc 
45660 

ttgaacacta tttactggtt 
45720 

aacagagcaa taataagagt 
45780 

ct.tgcctgtc aggcattgat 
45840 

ttttgcacaa aagactatag 
45900 

aggtagaatg aggccagctt 
45960 

catgagaggc catagtggga 
4 6020 

gggctgattg agtaaagcag 
46080 

ggcgactgag tgagggaaga 
46140 

cactggattt agagcagaca 
4 6200 

acctgttctg aagtggcttc 
46260 

actgactgaa atggaaatcg 
46320 

taaaatgggg gtgtttatat 
46380 

acaagtgtgg cagagctttg 
46440 

tcttctgact catccacatg 
46500 

gccagacctg gaaatggaaa 
46560 

ggatggttaa gtcctcttaa 
46620 

tggataacac tactttgtaa 
46680 

ggctgggcac ggtggctcac 
46740 

ttacctgagg tcgggagttc 
46800 

gaaaatagaa aaattagccg 
4 6860 

gctgaggccg gagaattgct 
46920 

ccattgcact ccagcctggg 
46980 

agacaggaaa ataaaaaaag 
47040 

attatatttc tttatt-ccat 
47100 

atgagaaaag gagagagtga 
47160 

agagtattta taaatttttt 
47220 

tgcaatggcg tgatctcagc 
47280 



ccaccttctt ccaaccctca 
tttggcaccc tttttctctg 
taggcaagta tcaccttttg 
ttggaaagat tatacatgta 
taaagcaaga aagacaggcc 
tttgaacttc attgcatagg 
atgagattaa cctggttacc 
cataaatgaa tcatcaggat 
atggagggaa agggacaatg 
tggagaagac agtgaaagat 
ggtggtgatg atattactga 
ttatcaactt gtggtgtcca 
agcatctctg aggtcagatt 
agtaggcaag gcttttgatt 
ccatagtctt atcacagggc 
taaactgtaa tacactgtgt 
gatgtctgct gaccctgggg 
cttgaaaatg ttctctgtag 
atagatgtca gaaaaaatgg 
aatattttat cttatttcca 
gcctgtaatc ccagcacttt 
aagaccagcc tggccaacgt 
ggtgtggtga caggtgcctg 
tgaacccagg aggtggaggt 
cgacaagagt gaaactccat 
ccacctcaca tagtctacta 
gctctttgtt tttaatataa 
taagacttta ttttaaaagg 
ttttttgaga cagagtctcg 
tcactgcaac ctccacctcc 
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tccctcttcc ttttgacagt 
acccaaatat agtcttataa 
ctcttttggc accagatctc 
tgtctggagt tgaatgactg • 
tacaggagat ggcagagggt 
caatcaagaa ctattgaagt 
gtaaaggaca aagtgattgc 
atgagaagca agggcttgaa 
tgagaagcag tgaaggagaa 
gtcagatgac taccatgttt 
agagagaggc aaggggtggt 
gacatttcac cctgggagaa 
cctagttcta ctatttttct 
tgtctcagtg gtctcttctg 
tatttggggg attaagtaag- 
acaattggat aattatggat 
gaccggagcc tgggagggag 
aaaagataat taacatttga 
aggtcatgta gacagaatgt 
ttataaaaga aaaaaagctg 
gggagactga ggcgggtgga 
ggtgaaaccc tgtctctact 
taatcctagc tactcgggag 
tgcagtgagc caagattgca 
ctcagaaaaa aaaaaaaaat 
ccaccaaaca catcattaac 
acaattactt ttaagggaaa 
tggaataatt ctaaccatgg 
ctctgtcacc cagggtggag 
cgggttcaag caattctcct 
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gcctcagcct cctgagtagc tgggattaca 
47340 

tttttttttt tttttttgga gatggagtct 

47 4 00 

gcatgatctt ggctcactgc aagctccgcc 
47460 

cctcccaagt agctgggact acaggcgccc 
47520 

tagtagagac agggtttcat tatgttggcc 
47580 

atcctcctgc ctcagcctcc caaagtgctg 
4 7 640 

ccataaaata ttttttatag acaagtgaga 
4 7700 

aattttgaag gtcatctact ctgaacgttt 
47760 

tttgtttttg cttagtttac atttattaaa 
47820 

cgccatccat gcaatatatc acaagatatg 
47880 

ctacccatcg tacagaggag gaaactgagt 
47940 

cagagccagg aagtggcaga gcaggccagg 
48000 

agccctcccc attgtccttg aaccagtaaa 
•48060 

gacaaggacc ccatgggtgt gtctgagtca 
4 8120 

tcttctgtga aaaagagcaa aagaaaaaat 
48180 

ttgtgtgaat gtatggtgtg ggagtgggag 
48240 

gatgccacct tctccaatca cataccgccc 
48300 

ttcttttggc tctatcctgg cttcttcaca 
48360 

gacagggata gagctgtggc agtggagatg 
48420 

tcttaaatat ggagccactc cataatcatt 
48480 

cttgccagct gagctatgaa aaaaggatgt 

48 540 

agcccctgca ggaatcaaag gagctgacag 
4 8 600 

aaaaaaaaat gtaaaatttt tgaaaagaaa 
4 8 660 

ctgggctccc aggcagcctc ctgcagaaca 
48720 

aaccgagtaa ggggcttttt gttttcctta 
48780 

ccactggttc ttccccacaa tccacactcc 
48840 

tagttttgca tttatggatt tttttatgcc 
48900 

gaggctgaag tggtttaagg aggcaactgg 
48960 

agctgtggac gttagacttg acgagagcaa 
4 9020 

agtctcccag ttcaatagtg tccacaaaag 
49080 



ggcaaccgcc 


accatgccct 


gccaattttt 


tgctgtgtcg 


ccccaggctg 


gagtgcagtg 


tcctgggttc 


acgccattct 


cctgcctcag 


gccaccgcac 


ccagctaatt 


tttgtatttt 


aggctggtct 


tcaactcctg 


gcctcaagca 


gaattacagg 


tg.tgagccac 


cgtgccaggc 


gcagaaatca 


caggttctta 


tgagcaggaa 


ttttgtttgt 


ttgtttgttg 


ttgttgtttg 


tacccgttat 


ggtccaggcc 


cttggctaag 


cccagcaatc 


ctaggaggta 


gggtttatta 


catagagttt 


tagtgtcctg 


atcctggtca 


ccaagtctgt 


ctgacatcag 


agctcatcag 


gatggagttc 


ttctacaggg 


gtggttgggg 


gaaacatctg 


cgagtgggct 


gagaaatgag 


gggtcaggag 


ccaataatca 


ttgtccatct 


caataaacga 


ttctaaggtc 


acacagaaaa 


ctcgtccccc 


agttttctct 


gaaatagctc 


caggggtgtc 


cagtcatctc 


atcctggtgg 


aggaagctcg 


cctcctaagt 


gagtctgaat 


tggagtgaat 


attgggccat 


ggcccttttt 


cctaagacca 


gaggctgtgg 


gaccattccc 


aattgtttgt 


ttgttttttt 


cacaaattga 


gcctcattga 


aaagaaatcc 


ctctccccag 


tccttagcat 


tgcagagttg 


ttcccatggc 


gaagattgaa 


tcctttcaac 


cagaaggtaa 


aaacccccta 


cccttatttg 


actacatgac 


taattgaaaa 


aggctaaata 


tacagaaact 


cccagtggtt 


tctcagcaac 


cacatgtcaa 


gacatatcag 


aatctgtagc 


aggagcatct 


aaatccagag 


gtttttgaag 


caaggaattt 
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gggtggcact gctgtgagaa acaatcacct 
49140 

cttcttcaaa taccccttcc tttccaaggc 
49200 

tggacctgat ctcttcaagg aaggggaatc 
49260 

gaggggttat gagaccatag gctcattttg 
4 9320 

ctgatatggg gaaaaaaaga catttctgaa 
49380 

ccagcataag cacctttctt agagtacttg 
49440 

ccactatttt acatcaagtt aaaatagagg 
49500 

gaaaactgca ctaaaactcc ctccgtcatg 
4 9560 

ggggccagaa atcatatctt aaagatcact 
49620 

aaatgtttgt tgaatgaacg aactagtaaa 
4 9680 

ttaagattct ttagtccctc, attttacaga 
49740 

gcttggccaa aggcacacag caagctaaag 
49800 

accccctggc tgctacactt cctgcagcat 
49860 

ttttacaggc atgtactgct agagccgaaa 
4 9920 

aagctatgtg agcccctagg gaggacacag 
49980 

ctgcttatta gcagaatgta. agtggttgtg 
50040 

atgcacacct ttcccacctt caaactcaga 
50100 

actccaattt ccctttctgt ttttcttaac 
50160 

gtatattagt catggtattt cacgctagct 
50220 

actcagtgaa agtttatttc ttactcatat 
50280 

taagccacaa cttggggatg gggatgcagg 
50340 

gggatggcag agttgctctg gcataatcca 
50400 

cagttaacca cctagcctag cattgacaca 
50460 

cattcagtca tgtggcccaa cctagatgca 
50520 

gtcagcaaca actttgcact tggaagggga 
50580 

taactagcaa ttatacagaa cgttatttgt 
5064 0 

atcttcacaa caatcctatg aggttattgt 
50700 

ggtagagata taacttaact aatgcaaagt 
50760 

tcaggctgtt ctctccagag ctcaggctca 
50820 

tgatctgaga aaatgcatca gccactaagt 
50880 



ggctcctcca 


tggggcatag 


agtggagatg 


catgactcag 


aatgactggc 


gtagggagcc 


agatgagctg 


tttaatctct 


cttgtaaaat 


gggggggtct 


aaaatgcagt 


attttttgaa. 


ttgttgtcat 


gttgcagatt 


ctgggccgtt 


gctttgtgaa 


gtagtcctta 


tcccctcctt 


aagatgccta 


gaaatggccg 


tatagacaga 


cctgactcct 


ctctagacta 


tgaccatcga 


gtgcctccag 


tacccagcac 


ggtgtttaat 


attttcaaat 


cattagagct 


gaagtatcct 


taaggaagct 


aaggctcaag 


acattgtgtg 


gcagagggag 


gacaggaccc 


ggctgtctca 


ttctaattct 


tttaccattc 


ttgcgaggga 


taattagaag 


cctcttacta 


ctcatcagaa 


ctagcctaga 


ctctgcctct 


ttgccctctg 


tatgatgatt 


agtgtaagta 


ggatgggcaa 


agttgtaacc 


aagagtcaca 


ctgactaaac 


atatgtccta 


ttttaccaat 


aatagccatg 


gcagaaataa 


cttccaaatc 


tcattggctt 


aaagttgaat 


gtcctggtca 


ggcagttatc 


cagcttccat 


cgtattggct 


ccaccattca 


accaatagag 


gggggaggtt 


tggcacttgt 


caccacttct 


acatacactc 


ccctagtcat 


aaggcatctg 


ggaaatgtag 


cccctatctg 


gcctgaatcg 


ttattggtct 


ccaacacatg 


caggcaatgt 


gccaagaatt 


atttcattta 


cctctttaac 


gtatagatga 


aaaagt tgat 


tgcataagtg 


gttggtagca 


aatccaaaat 


tgattgctgc 


attctactgc 


tttgagcttc 


agcctgtgta 


gtctccagca 


attactttcc 
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tccctctgga tcttggtttc attctctgca 
50940 

atgtcacctg ccagctggga catcatatga 
51000 

gtccccagtg atgcttgacc atagcaaagc 
510 60 

aatggaagcc tcacagtgtc catgtaggag 
51120 

acaaagaaat caacatatag taaaaagaga 
51180 

agcggtagcc ttggaaatag gactttattt 
51240 

ggaaaagagt gcttctttgc cccaaatgct 
51300 

aaacacacca ttcccccaat ccccctcaaa 
51360 

gctgtaggac tttactgagc agtgtcctgt 
51420 

gctcgaggtg tatgaattcc ccagcattac 
51480 

gagctccaaa ctcatgctag acccaagtat 
51540 

atacagcaaa gctatatttc actgtgtcct 
51600 

gtgtacaact gcttgtgttc tttctaccta 
51660 

ccaaactcca tttcaggttc ctcttgagat 
51720 

aggcctcttg gaatgtcttt tgaggcttta 
51780 

aacagaagaa gagcatcacc aaaaggattt 
51840 

gccactagca aaggcattct ttcttgggaa 
51900 

cacagaaaca agggagaata aatagagact 
51960 

tgaaaaggct gagctctagg cagagaagat 
52020 

tatttgggta ggttcctctg gtgtaaaggg 
52080 

aaagaacgca aggtgtcaga gggtggatgg 
5214 0 

tctgcccttt gtagcctggc ccagagagcg 
52200 

tggcattttt cccatggttc tgtcatctta 
52260 

tcttgttggt ttggagaagc ttctttttaa 
52320 

ttttaatcag gtgtggtaag atgcacagac 
52380 

tttatactca cagatccctg taaataggaa 

524 40 

caccagggtc agccgcaagg cagaaggagc 
52500 

gtggattcag tggcaaagaa tgggaggggc 

525 60 

gaatgacttg attgagctgt agggtgtaga 
52620 

attagggcag ctggatagtg gtctggagtg 
52680 



aagtgaggat 


gtttaactgg 


ataaaatctg 


ttctcagggt 


aagcatatca 


ggtgggtggg 


cctttcaaag 


gtttcttagc 


acaccacata 


aaagcagggc 


aaagtatttc 


catttaccca 


gtgttttccc 


accaaggcct 


cagattgact 


tgtatagtac 


ttttgccacc 


agggtggggg 


ggtttcataa 


aacctaaaga 


tgtcacatgg 


aaactacttg 


cacttaaatg 


aaagagtaaa 


ggggtccttg 


cactgccatg 


ctcttgaggg 


ttctccttag 


aggtttcaga 


tgagcagtat 


ttcatgaaag 


aacaatcctt 


gaatgacttt 


agaaaaccaa 


t'tgtgtgtgt 


ttgtgtgtgt 


tgtccccctg 


atgcctccac 


acagaacatc 


tcccaaactt 


ggaaacagga 


gatgcttcaa 


tattgtgata 


tgttggacag 


atggttaaga 


ctcattttat 


gtggagatct 


attaatattt 


tgaattatgc 


ccctagaatc 


agattgaccc 


tgagcttaga 


ccttacaaca 


tggccagagc 


gcaagagcag 


cttcagaaga 


cctgagagct 


ttcttgttca 


cgttttcttc 


cagaataaga 


aaacagggta 


taaagcagga 


gcatttggaa 


tcaggcagct 


tgttgggtaa 


taagtaacac 


aagagcagga 


tacataaagg 


gattcagatg 


taccttgttt 


taaaatttac 


ctggaattta 


atggagatga 


cagtcatgaa 


ggaagaagta 


gcatggcctc 


catgcaggcc 


aatggggaag 


aagaggaaaa 


catggacaag 


aggctctact 


agagtaagca 


ggtttaggat 


tatcgggttt 


gactgcctct 


actgtctggc 


accaggggta 


tgagagctcc 


ctaaaggagg 


tggttggagg 
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tgtaggtttt ggattggttg atctgtatat gaaaggtgca cgtgcaggtt gagtcctcta 
52740 

ctatcactag aaattggctg gtcccaggag aagtagtctc tctagagaca gcaatgcccc 
52800 

agatgtcaaa gcatcagaaa atacagaaaa aaaattaaaa gcatgattaa ttcatactca 
52860 

caggtctagt ttttgtgtag ttaagagcaa cctaaagaag ttgataactc gtgttgcagg 
5292Q . . . - - 

tcaggtttcc cagaaatcat attctcagat gaagatttgc atgaaggagg tttaatgctc 
52980 

aaactaagcc ctaaggctcc atacctgtgg aggaagtgaa agaagcccaa ctgggcacag 
53040 

aaggtggaac acaatgccag tcacacaaag acctcagtgg atcctgggcc atgaggagct 
53100 

ctaagcacag atgacccttc agaaatgtct ccaagtgggg aaaggaa'tca tgctagtcac 
53160 

tggatgtggg cttcccactc caccccatga gggcatgacc ttaagtgaga gagctctttg 
53220 

gacacagggc atcctaagag gggcactcag cagccacatt gggcaccaag actctcagca 
53280 

gctagaagaa gaaggtatag tcccaaaggg gaatctgggc tgcacacctt agtatccatt 
53340 

agaactggaa gtaggctgaa tcccaggcag ggatcccctg gagaacacag gtaatttttt 
53400 

aaaaaatcaa gctatgtgtc tgaggctatg tggtaagaca tctcagtttt ctgctaggaa 
53460 

aagccaccaa accagattgg cttattcatg ttgaaaagtc tgagaatcac actcagatgt 
53520 

tgttgataat tctgcttgga taaaatttat ctattggtat gcttgtgata tagcagtacc 
53580 

attgctaaaa attccatgcg gagaatccaa tctgcatcat tttctttctc aatgatttgt 
53640 

ttttaaaggc agaggttcgg ctgtgcccct ttaaaccttc tgtgcaagtg ccagcttcct 
53700 

ttcaaatgga gaagcagcag ccctgtcaga aagggtggct ggagctcccc ttttgtgaga 
53760 

ggaggaaaac ttactgggaa ttacctgttc gagagccaca catgaaggca taccactgct 
53820 

tcctctgacc ttccagccgg tatattaatg acatactgtt • gtacctgaga accaatgatg 
53880 

aagtgggtga tgtgcctggc accttaaagg cctgggcctg ctttgacagg ggagatgata 
53940 

cacaacatgg ctgttagcca gctctcactg catctggaag caccatgttc cttagagcca 
54000 

aagttctcaa actgtgcttc ctgctgggct ccacagatcc ttcccgttcc accctgcaca 
54 060 

caaacgtgca cacacataca cacacacaca cacacacaca cacacacagt gttctcaatg 
54120 

ctcgccattt agttagtatg caccaaatat gtgtagtatc tggttccacc cctggcctct 
54180 

cagacaatta ttagtatttt tgggagcggg gaggagagtc aggaagaccc aagcgccata 
54240 

tttattattt ccccagccac cccggcccag gctacatcca agttcaaagt ctatgacccc 
54 300 

ctctctgagc tttcagcact acctcccttt gtgggggagg ggggtgccaa ttctctttct 
54360 

tctcatcatc tcctgttgca aaataaaagc ctaggcattc ctttgagaaa cttgggcctg 
54420 

gcattggaag gcgtctgaca aaggctttgt taaatgagtg gagggaggga cggtctggga 
54480 
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gatacttttt caggtggcat aggacctccq 
54 54 0 

ttttct-agaa atctacaggt gtttaagctg 
54 600 

cctttcattt tgcagatctg aggcctagaa 
54 660 

gacagaattg ctcagtgaaa. agtccagcat 
54720 

tgtgctcagc tagtgaggca cacttacttc 
54780 

cttaggacag cctccacctg ctcctactgt 
54840 

gctgaggagt aataagagca cctgatatgt 

54 900 

ctaagt-actt cctatgaatt ttccatttat 
54960 

attattccag aagggaaaac cgaggcaatq 
55020 

ccattacact attttgcaca agtagccagt 
55080 

tgcaaaaggt aatttcttta gcactttatc 
55140 

agggaagaga agtggggaag aaaagatgac 
55200 

tcgagtgcca agtgcgctgc gctggggtct 
55260 

gcttcctgga attctctcct ctcgtgqggc 
55320 

aaagatctac agacaaattg atctttaagt 
55380 

aggattctat atccttccaa ttatctctgt 
554 4 0 

agggagccgg acactcgtct gtcagcccca 
55500 

ccctgcctgg tcacgggaga ggaagagaca 
55560 

cacgcgctct ctcccaccca cacacccgct 

55 620 

tcctcagagc ctctgcagac tcccagcagg 
55680 " 

atgctgtaaa ctcatttggg tcttgcttgg 
5574 0 

taattgcaga gtcgatgatg atctctaggc 
55800 

attgccagag gagggcaggg atgggggcag 
55860 ' ~ 

aaccaacctc caacaatttc agccattgca 
55920 

tggacaatgc caattaaatc agagcaggtg 
55980 ~ 
ctacagtttt attttgaata ctgttgcagg 
5604 0 

atctcagtct attcccagct cagccaaaag 
56100 

ccctatccat ttccttacct gcaaatgaga 
56160 

tcaactctaa aatcctagat gatcctcaga 
56220 

gatgtcaggt ggtccgcaga gggtgaggct 
56280 



cttcttccct tctcacatga gaaggaagat 
gaatgtgcct cagacatcat ctggttggac 
agatttggta acttgcccca ggtcacagtt 
aaatacccca gcccatgtgg ccactggctg 
ttaatttgtg ccacccactt ttcaggctcc 
gcttcccatc gtccctctcc tcaggcacag 
gtcaggcctt actgtgtgct aggaattgtg 
tctttataat aactttgtaa agttagagcc 
ggagtcaaag caaagaattt gggcttttaa 
aatgaaaagg ctgctatccg gaatcatctt 
agaagaaggg ggctccttcc tcaaattctg 
tgaatccaaa gctcgggcag ggaaagcaca 
agtcctgact cagccgccat cttcccaagt 
ctcagctcct tcatcttagg aaagaagggt 
atccttagag cactaccatt ttcagaatct 
gtagggaatt attggtcgtg tctcctgatt 
cctggctctg caaagtccct tgtgtatctg 
aggaaacacc accgctccga ctctgtggag 
caggagagga ggaacctgca catttgagtc 
ggtctggctt tcctctcagg tagcacagtc 
tatgataatg cgtttagttg aagggttata 
caatttaaag . tcaaagctat ttttaatgga 
ggaggagaga tggttagaga gtgcttttga 
tttccgaacc tgaattttca gggcagaaat 
tatgtgagag ctgggttcac cttcttgcag 
tagtgaaaat atgactaggc tgaataagag 
cccttagtgt gtccttgatc aagttacttc 
agcttgaacc aaactatcct aatgtccctt 
tgtcaacagt gctgaagccc agcactgtaa 
cttcctgctc aaattatttc ttccacccaa 
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gactcctcag ttacctctgt acacaacctt 
56340 

ggctttagtt tacaaatttt cttggggaag 
56400 

tgaatgccat gtagggaggg catggtttga 
564 60 

ttattctcaa atcttcagag aagaaaattc 
56520 

aaatgtcccc taggctactg tcaggttagg 
56580 

tataattgaa caagataagc atgtgaccta 
56640 

cagcacagaa gtggctctct gaaccaattt 
56700 

gatttagcct gtggttatag tgttaacaat 
56760 

ttccctttag caggatcatt ttctctgtgt 
56820 

aaataatagc atcttctgga atacaaatga 
56880 

agtaggatga tttctcagct ctaatacact 
56940 

tactggtcag gccagttcta gatacaatca 
57000 

tggattgcat atgtgacggt agtcctttaa 
57060 

ttctaggtct agatatgttt agatacacac 
57120 

gtttccagta cagtaacctg ttgtacaggt 
57180 

acagcctagg tgtgtagtag gctataccac 
5724 0 

tttcacagtg atgaaacttc ctaatgacaa 
57300 

gaggcatgac agtactatat ctcaagactg 
57360 

agaccaatgt atttagttga atcgtggacc 
57420 

aatacaagtt caagtccacc cagaacctca 
57480 

tgcaaatgta gtaagttaag atgaggtcat 
57540 

tgactagcat ccttgtaaga aaaggaaaag 
57600 

aacagagaca aagactggag tgaggcatct 
57 660 

agccaccaga agctaggaag aagcaaggaa 
57720 

atggccctgc tgacaccatt gtttcaaatg 
57780 

gtatattgtt tcaaaccatc cagttggtgg 
57840 

gatggagagg tgtctgggaa gcccatgaga 
57900 

ttggctcgtc tacagcccat tttcttcatt 
57960 

ctgtgaaacc tgtgaattgt ctccaccgca 
58020 

cagggttgtg tggcaaggct ttcatgcctc 
58080 



gcaggcccat ctaagtatcc aataacctgg 
aaggtaaaag ggatctagct ttctgggtta 
gttagtcctg gtgctgggag ttcatgagac 
cgtgaacacc tgggaacatc aggaaaaaaa. 
ctgctggttc tgatttgacc ttgaacttgc 
atgaaatact ttaaaacttg tagcttcctt 
taagcaatcc tggctctatc tgtgcatgtt 
ttagtgattc acctcatttt taatctctct 
taagggatca acattgaggt aagaatggct 
ctttataaat aaaagaagat aaaaggaaga 
tagcaaatgc catatgcttt ctcctgcgtg 
tgcgctgcat aatgatgttt tggtcaacag 
gattataata ccatattttt gctgtgcctt 
atacttacca ttgtgttcca attgcctaca 
ttgtaaccta ggagcaatag gctataccat' 
ttaggtctgg gtaagtacac tctatgatgt 
atttctcaga atgtatccca gttgttaagt 
tccccaagct gaagtctcca gtggacacaa 
ccaaaagttc aagtccaccc agaacctcag 
gaatacaatt ttatttagaa atagggtctt 
accagagtaa agtgggccct aaatccaata 
gaacacagac aggggagaag gccatgtgag 
acaagacagg gaacaccaag gattgccagg 
gcatcctctt ctggggcctt cagagacagg 
tttagccttc agaactgtga gacaataaat 
tactttgtta taggaaacta atacattcag 
acaaatggaa agagccagaa gccctcaacc 
cccgcatcca ggctttgaga -tgacaggaag 
aatcctgctc cctggtccca cctagactgt 
tcactgactg cctagtacgt cccctcaatg 
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actggtccac atctttctca cctttctcat gcatggcccc agatccaccc cagtgcctcg 
5814 0 

tcctcaagag gtgatttatt ccgagacact gatgagagca ctgtccttcc tgtgtctgag 
58200 

ggaaggcatg taactcttgc ttatcttcac ctgtgctcta gatcctgacc ttctctggca 
582 60 

acctcaggga ccttgcacca tccattcttc tcgcctaatg gcgagactca gtctctccct- 
58320 

ctccctttcc actctccctt gccattctta gtatctttct acaagcaggt cttccaaagt 
58380 

actgcttgag gtctgagttg gagggaacat gcctctaccc tactaaaaag agaaattcct 
58440 

ctgcagaaga cccaagctga ctgacaaatc cctttactgc aactgcagct ctagctccca 
58500 

ccattttcct gtacttactc tcctgctcag gttccctggc attgctgatg tctttcagcc 
58560 

tttgtgccct ggcccctttc ctcctctccc ctcatctagc actacctgtc aaaatcaggg 
58 620 

acttacttta aaatttatcc caaattatca ttgccatcat ctccactgtc accttatcat 
58680 

atgtttgaat agcgtttcca tttcccaaat gttttcgcat gcactttctc aattgagcct 
58740 

tacgaatcct agagctgaga agggtaacaa tttatgagtc ctttgacaaa tgtggaaact 
58800 

gacatcacag aaagtaagtt gccagccgat atgtcactgt cttcaaactc ttctttgtat 
58860 

ttttattatc tcccattata ttctgcctct tgtaatgatt atttctacat tggtcatatc 
58920 

tttccttctg tactgatctt cgcttatgat aacaaataat aatagtttac ctttgcatca 
58980 

cacttgatgg tttacaaaat gcttcaaatt caacatggcc cttgatcctg aagatattta 
59040 

tcacttaaga atcattatcg ccattttaaa atacaaattt attacttggg ctaaattttc 
59100 

ttattatagt tgggataggc cttcatccat agggtgagtg cagtatttgt ggactgtcat 
59160 

ggcagcttaa acatttagta cttgaaaatc tgatgcattg atcatcagag aaatgcaaat 
59220 

caaaactaca atgagatatt atttcacccc agttaaaatg gcttttagcc aaaagacagg 
59280 

caataatgaa tgctgacgag ggtgtgaaga aaacggagct ttcatacact gttggtgagg 
59340 

atgtaaatta gtacaaccac cagggaaaac agtttggagg ttcctcaaaa aactaaaaat 
59400 

tgagctaccg tgtgatccac caatcccact gctgggtatg tacccaaaag agaggaaatc 
594 60 

agtatatgaa agaggtatct gcagccgggc gcggtggctc acgcctgtaa tcccagcact 
59520 

ttgggaggcc gaggcaggca gatcatgagg tcaggagatc gagaccatct tggctaacac 
59580 

ggtaaaaccc cgtctctact aaaaatacaa aaaattagcc aggcgcggtg gcgggcacct 
5964 0 

gtatttccag ctactcggaa ggctgaggca ggagaatggc atgaacctgg gaggcgtaac 
59700 

tttcagtgag ccgagatagc accactgcag tctggcctgg gcgaaagagc gagactctgt 
597 60 

ctcaaaaaaa aaaaaaaaaa aaagaaagag gtatctgcac tctcatgttt gcagcagcac 
59820 

tgttcacaat agctaagatt tggaagcaac ctaagtgccc atcaacagat gaatggataa 
59880 
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agaaaatgtg gtacatatat acaatggagt actattcaat aaaaaaaaag aatgagatcc 
59940 

agtcattagc aacaacatgg atggaactgg agatcattgt gttaagtgaa ataagccagg 
60000 

cacagaaaga aaaacatctt atgttcttac ttatttgtgg gatctaaaaa gcaaaacagt 
60060 

tgaacctatg gacatagaga gtagaaggat ggttaccaga ggctgggaag ggtggtgggg 
60120 

ggcttagggg gagggtggga tggttaactg gtacaaaaac agaaagaatg aataaggcct 
60180 

actatttgat agcacatcag ggtgactata gtaaataata acgtagctgt acatttttaa 
60240 . 

aaaacttgag tataactaaa ttgtttgcaa ctcaatggac aaatgcttga ggggatgaat 
60300 

atgccattat tcatgatgtg cttatttcac attgcatgcc tctgtcaaaa catcatatgt 
60360 

acccaataaa tatatacaac tactacatac ccacaaaaat taaaagtaaa aaaaaaaatt 
60420 

aagaaaataa aagaacaaaa gtagatgtat tctacatgtc tccatattgt aaaactagaa 
604 80 

ccagtcagtt aactttagag gaaggggatt gtggacttga tataaagaca actttataat 
60540 

atgcagagca gcctaatcct acaattgtca aaaagtatag tggattcttt atttatttgt 
60600 

ccatgatatt atagaggtca tttctgcttt aacaagtagg tgggagatag ctaggtagga 
60660 

tatattttgt tcttattttt tattttaaaa tattgggctg tggctggaca tggtggctga 
60720 

aacctgtaat ctca^cactt tgggaggctg aggcaggcag atcacctcag gttaggactt 
60780 

ttcgagacca gcttggccaa tatggtgaaa ccccatccct accaaaaata caaaaattag 
608 4 0 

ccagttgtgg tggcatgcac tgtagtctca gctccttggg aggctgaggc aggagaattg 
60900 

cttgaacata ggaggtggag gttgcagtga actgagatta cgccactgca ctccagactg 
60960 

ggaaacagag tgagactctg ttttatatat atatatatat acacacacgt acatatacat* 
61020 

gtatatatat acacattatt attgaaagca gccaaagaaa aataacacat tatatataga 
61080 

gaaagagcaa atgatgagtg actttatatg tatatatatg t'gtgtgtgta tatatataat 
61140 

gtgtatatat atacatatat atatataggt taagaacctt cagcacatgt atacctatgt 
61200 

aacaaacctg catgttcagc acatgtatcc cagaacttaa agtgaaaaaa aaaaaaaaga 
61260 

accttctgca tgccagtaac tgtgctaagt* gattaggatg caatggtaat aaaaacaaag 
61320 

tccctctcct taaagaattt tctatttaga agggaaaact ggtaaataaa aaataaatat 
61380 

ataaattaca atttgtgaaa agtgctacac atgaaagagt gctgagacag acatcaatgg 
61440 

ataaacttta gattgagaag ggctctgaca aagcaacatt taaggtgcaa cctgagagaa 
61500 

tagaagttaa acaggcagat attggtgaaa gagcagtcta ggcagaggga acatcatttg 
61560 

caaaggccca gggtaaagaa gatcctggta aggaaatgac agtggaagaa ggttagtgta 
61620 

gcaggactgt ggctagggcg gagaggcagg gaagtagttt agaatttcaa tgcaatagga 
61680 
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aatatggaag attgaaggca 
61740 

actttatcta aggatggaag 
61800 

tgagcagcag ctgctctaat 
61860 

tctgattcat aaaggggcat 
61920 

cctgagtcta aggggagacg 
61980 

taagaatgaa aagaggaagg 
6204 0 

tttttttttt tttttgagac 
62100 

aatctcagct cagtgcaacc 
62160 

ccaagtagct gggactacag 
62220 

aatgagtctg tcttcactgt 
62280 

gaaataactc tggctgctca 
62340 

ccaggctgta ccaggcactc 
62400 

ggagagccta cactgaagtg 
624 60 

gtaaagacgt ccaagagaag 
62520 

atcagagaga aaggaatttt 
62580 

aaaatagaag ggaaggacaa 
62640 

ttcactctct ctctaatcct 
62700 

tcctgctgga gtcatttctg 
62760 

ttgcaggctt ccctgtcttt 
62820 

tgtgtggccc tttgagctgc 
62880 

attaatctac tgtagcatct 
62940 

ggctttgtgc acagggcaag 
63000 

gcccttgtcg ataggatctg 
63060 

caacatttat tacaatctga 
63120 

tatcagccac agtagatgta 
63180 

attttttttt tctagttcat 
63240 

atgtcctttc tcagtcacca 
63300 

aatcagtcta aatcatattc 
63360 

/tctaactttc cagcctcaac 
63420 

tgtgggatcc tgaacttgta 
63480 



gttttgcatt ataaaataat 
attcttaaat aaacttgtgt 
tcagagcagt cctcctgcca 
ttttctctgg gagaaaacca 
aagcagcaaa agaaattggc 
gaggtgctga gccctttttc 
ggagtcttgc tttgtcgccc 
tccggctccc gggttaaagc 
gccctttttc ttaatccaca 
ttccattcag tggctggaga 
cccagttgtt tgtaaatttt 
agatatgaca gtgaatgaga 
gacatgaggg agttgaaagc 
aaagatgaag ggcaaacaca 
cagacctgac ctgaatgatg 
gggaaattat ctgggcagca 
tttccacccc agccccaaat 
gtattaaaaa tgtacagaaa 
cattcctatt gtagaaagca 
tgtgattgtg ttaggacccc 
ctacaaatca agaggctggc 
ctaaatgtct ccctacagtg 
atatatagat gcatgtctac 
atgtgaaatg gctattctgt 
taaggagcct ggtttcactg 
ttttagtctg tggagcaaac 
gggtgtggtt atttggtttt 
tgttgaaatc agaaccaaag 
tcctacctgt tctttgttac 
acctgtgctc agactggtgc 
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atgattgcta ttttaaagct 
atacttggac cacaccacca 
aacgctgtgt gagacaaagc 
gtgatccatc tgtagaagta 
ttgtgaggac agggacattg 
ttttttcttt ttcatttttc 
aggctggagt gcagtggcgt 
gattctcctg cctcagcctc 
accttcagtt ggattttgca 
caacttggaa gagaatctca 
tattgagact ctactgtgtg 
taggcaacat ctttgccatt 
aactcttata ggaaatcatg 
tgcacggatg ccaaacatct 
aaaggaggtt tttggaaagg 
atatttatct gctgtggtgc 
ttgaaaggat tgcagggagc 
ggaaagcttt ggttctgagt 
gcttatataa aaagatgtgc 
actggatggt attcgcatga 
ttctgtttga aatgtcccaa 
agactgaaaa tgccttgggt 
aattgcacag tggctgctgg 
tcaaggattc tgataaaaag 
caactgacta cagttatctg 
agagatttcc tccccaaatg 
atgtagagga gatagaaacc 
gatccacaat ctggctccaa 
tcttacccct ctaaaccact 
ttttgcactt ctctgatggg 
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aaagatttct ctcatctttt atgattcagc 
63540 

tgctcctgct ggagagcttg tttcttctgg 
63600 

ttgggataca aagccaagta acatagcatc 
63660 

aatatagaca agtaaataca ccctgtgcaa 
63720 

aaataaagaa tttaatcaaa tcaaattgaa 
63780 

tgatgaggtt cccagtttac tctagtgagg 
63840 

acataaggct ctatcctatt attaaataaa 
63900 

caacctgaca ccatcatcat cataaaataa 
63960 

tctaaaccaa Cdttgtatqa gataattact 
64020 

gcctctaaat actaacjittt cagttacaat 
64080 

tttggtccat ttt :cdj<7. : i;> ^.ir t Mcttt 
64140 

attagcacaa aggj^t t .jrj oj^tda^ttt 
64200 

tgcgtctttt ttct t :t t * t a-r a-.irtgg 
64260 

gtaagaatgg acattctr^: rtta'ttcca 
64320 

tcatagatgc cttttatcao *cr.gattaat 
64380 

agtttattat ttcatootou ttttttaaac 
64440 

tatgtatctg ttgaaatg.it catatcgttt 
64500 

cacatttatt gatttttcaa atgtgaagca 
64 5 60 

gtcatcgatg tgttatcctt tttgtatgct 
64620 

gtatttgtgg tgtcttttca tgagagatgt 
64 680 

tttgtaaggg tttaagaaag caaggtcagg 
64740 

ttcaccttta tcttctgaaa gaatttatgc 
64800 

caacagaata taccagagaa actatttgga 
64 8 60 

taaataatgc aatttcttta atacatagta 
64920 

tctgatgaat tgtgtttttc aagaagtttg 
64980 

caacaaagtc tttgataata ttcctatatt 
65040 

atgttccctc tctcactttt ttaaagaagt 
65100 

tttattttat tttatttttt cttatttgag 
65160 

gtgcagtggc tcgatctcgg ctcactgcaa 
65220 

tgcctcagcc tcccgagcag ctgggactac 
65280 



tgaagtttca 


atgcttctga 


aattttttcc 


attcccatag 


gtcaggtcct 


qtqtttqqca 


catattctca 


aatcctcaca 


atttqgtaqq 

3 3 3 3 


ccttttgtaa 


cagaggtata 


aaagggtatg 


tatgggcttc 


aactctgaga 


tcttcttcca 


tcatgattcc 


atactggcac 


tcttctaggc 


gattattacc 


attctcactg 


caagcagcag 


gtaaaacnag 


agttaattaa 


gtgtgaactt 


cataaaaatg 


attcttcact 


ttccaaaggt 


aaaacttaga 


tccaatttac 


agatattaaa 


tctcataaaa 


taaaaaaagt 


atgtgagaat 


tatttatcca 


qatqtqaqat 


aagaggcaca 


ttaggacctc 


tagtatgttg 


aataaaagtg 


gtttgcttta 


atatgttttc 


tgtcagtttt 


tcagtctatt 


attatttcag 


tatgttattc 


catgaatgag 


tttgaatttt 


gtcattcctt 


tgctttctaa 


agct tctaat 


atggtttaat 


aatttaaatt 


catggcataa 


at cctacttg 


tctgggttca 


atctgatact 


attt tgttaa 


tggtctgcaa 


tttttttttc 


tt gt aaggtt 


taagcttcac 


aaagtaagtc 


aaoaaatatt 


aacgttgaaa 


ttatttgttt 


caaaaataat 


cttagagctt 


ccttqqqqaa 


aggtttttga 


cttatatttt 


ctatcttacc 


ttgtgacaat 


cccatgtcat 


ctgagttgtt 


aaacttacta 


agcctttgaa 


tgtctataag 


atctgtcctg 


cttgctagag 


gtttaccaat 


tttattttgt 


acagagtctc 


gctttgtcgc 


ccaggttgga 


gctctgcctc 


ccaggttcac 


gccattctcc 


aggcaccagc 


caccatgccc 


ggctaatttt 
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ttgtattttt agtagagacg gggtttcacc 
6534 0 

ccttgtgatc cacctgcctc ggcctcccaa 
65400 

gcctggccga ggtttaccaa gtttattaat 
65460 

aatttttcct attttttatc tacattatac 
65520 

tccttctctt cactttgggt ttaatttgtt 
65580 

gctgagatca ttgattttga acctttcttc 
65640 

atttcccttt aagcactgcc ttagctgtat 
65700 

tcttttattc aatatattct aatttttctt 
65760 

agaaatatgt agttagtttc caaatattcg 
65820 

atttctaatt taattctgct acaatccaag 
65880 

aatgtattaa gaatattacc agagataaga 
6594 0 

atatttcata aataatagac gaattgattc 
66000 

atgtgtctaa taacagagtc tcaaattata 
66060 

aatggccaat cccacaatct ttatctttat 
66120 

tgctcttgaa aagaaagtgt attctgtagt 
66180 

gaggtgattg ataaaattat ttagattgtc 
6624 0 

gaattactat gacaaggatg ttaacaacct 
66300 

ctctttagtt ctgtcagttt tgttccatgt 
66360 

ttcatgattg ttgtatcttc ctgatgaatt 
66420 

tttatctctg gtcatattct ttatcttgaa 
66480 

tcatcctttt tatgcttacc atttgcatag 
66540 

ctatttatcc ctttatactt aagtccatgt 
66600 

tgattatttt tactcctttc tgacaatttc 
66660 

agttggagtt aaatctaccg tcttgttatt 
66720 

ctcatttcct tgtttattac cttcttttca 
66780 

cctcaattgg ctttatagct atatatcttt 
66840 

agcaatatgt atacttacca cagacaattt 
66900 

tagaagaagc ttgcagcagt ctatgtccct 
66960 

cgtatgtatt acatcacgta cattgtaaaa 
67020 

aatccttgtg tgaattaaaa attttatgag 
67080 



acgttagcca ggatggtctc gatctcctga 
agtgctggga ttacaggcgt gagccaccgc 
cttttcaaag gactacattt tggctttgat 
tgattccaat tcttatcttt attcttttct 
catttttttt tctggcttct tgagatagaa 
ttttctaaat aagtgcattt aaacttacac 
ctcacaaatt ttgatattgt cttttcattg 
gtgatttctt ctttggccca taggctgttt 
aagactttca cagatacctt actattattg 
tatatacatt ataaagtttc agccttttga 
agataagaat attaccagcg ataagtaggg 
atcaagaata tacaacaatc ataaatgtgt 
tgaaacaaaa ctgacagaac taaagagaga 
caggtgattt atcttggtga acattccttg 
cattgggtat aaaattctat atatgacaat 
tatatcctaa gttttgtaga attatttcat 
acagctatga ttgtggaatt ggctatttct 
aatttgaaac tctgttatta aacacataca 
ggttccgtta ttatttatgc aatgtcccta 
gtctttttaa' ctgatatgaa tgtagccact 
tttatatttt tccattatct tatattcaca 
cttgtagaca gtatgcagtt ~aattgtgtct 
tgcctttcca tataatatgc ttatcaatac 
tgtcacatct cccatctttt gttgttgttc 
gttatttttt ttttgtattc cattttaatt 
gtattatttt ttattgtttg ctctagggat 
agaaatcata ttgtaccact tcacataaaa 
ttacactccc attctttgtg ctattgtttc 
tccacaatag agtgttataa tctttttcca 
tagaaaaata catataacat tttattctta 
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cctacatact taccagttct gctttctttt 
67140 

acccgttttc atttggtgtc atttccatta 
67200 

ttctctattt ccatgtatct taaaatatct 
67260 

taattggaca tagaaatcta ggttggcagt 
67320 

tctgatttct gttgtttatg aggaaaagtc 
67380 

tgtattattt ttctttggct gtttcaagat 
67 4 40 

tttgattatg atggcctaag tggtattatt 
67500 

ttgagcttct aacttctgtg agcttttttt 
67560 

attattatat aatttttctt ctccattctt 
67620 

aggttagact gcatgacgtt gtcccataga 
67680 

tttctctatg ttcttcagat tggacaattt 
67740 

tttattatgc caccttcaat ctgatattaa 
67800 

tattatttat agtattaatt tctctgctaa 
67860 

caatattagg ttcttaaaca tattttaata 

67 920 

atctgagtca tcttggggtt attttctatt 
67980 

tttttttctg tttcttcaca tgtctagtaa 
68040 

aatattataa acagtatcaa ttacattatc 
68100 

gaagtagtta aattactagt agaacttttt 
68160 

tacagtgagc ctattttagt tttaaagtta 
68220 

tgctccttac ttctatcaca tttatttcta 
68280 

gagcaggaac tgcaacattt cttgacatgg 
68340 

aggcctgtaa gaagagatct ctgttgggtc 
68400 

cagccttcag ccaaagactt gcaggaaaac 

68 4 60 

ttgatgttta ggaagctaaa cgtcaagtat 
68520 

aatttcactc accttaacaa ctcagaactc 
68580 

tgtcactttc tgtggg.ctcc atcttcctgc 
68 64 0 

ggctggacaa ttgtgtagta attgcttcac 
68700 

tgcactgttt gctgttcaat acctgaaaat 
68760 

gattgttttc aatgggagat catttctagt 
68820 

agttcaactt ggatcatttt aaaaatacaa 
68880 



cattcttacc tgtttcagtc ttatctgtaa 
gcatttcagt gcagaacttc tagcaacata 
ttattttgcc ttcgtttttg aaatatattt 
tttctcttat actcttgggt ttcattgtct- 
attgattatt tgctctttct ctatacacaa 
atttttctct ttatctgtgg ttatcaacac 
gttgtttgta tttattccac ttggtgttcc 
ttctcagcga atttggaaaa atttaagcca 
tctactctct ttggaactcc agttgtacat 
tcactaagac tctgttcatt tttcaatttt 
atcttgatct ctattaatgt tcacttatcc 
ggccattcag atctagaatt tctattaggt 
gattttttgt ctgttcattc attatgacca 
gctgctttca agtccttgtc agttaattcc 
gagtgatctt taccttatct gtcggtcaca 
ttatttattg tttgctgtat attgaaatga 
ttccttttaa gggtattgag ttttgttctg 
gttcctgtca aacttgatct tattctttgt 
gtcctagggt acaactcttg ctctattgta 
ttgcctgaga tagtcaatga gttctcacct 
tcttacctat gtattcatca ttcatctctc 
ctgtggaatc ttgcttgcac ttggacagct 
cccatagaaa catctgggcc ctctcaatat 
agcctccttt tctagggacc ctatcttgtg 
ttatcttctg ccttctcagg ggagctaaac 
tccacaatag gaaagtatct gcagagaaaa 
gcatttccct tctctcaaag attgtaagtt 
gatttctaca aattgttttt ccagttttat 
accagttcct ccatcatggc cagaggtaca 
actggggcat gtcacttcct gccccaaacc 
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ccttggtagc tttccattgc tcttagaata actttgtgat ctacaacatc ttcttcaaqq 
68 94 0 ■ 

ccccgcatga tacaaattct ggctatttct ctagtttctt attgcaccac cttqtccctc 
69000 

atccaccttt tttttagtct tctctctttc tttgaacttc taccaccagg ttttttcaca 
69060 

cgttcttctt tccccattaa caatgatcca ccattctctt tctttatcca ctgttactca 
69120 

tcctcataac tgaaacatca tttcctaagg atggccattc ctggttcagt cagtctatat 
69180 

ttcatccccc atcacatact cttgttttac cctatatttt tccttcaaaq cacttattta 
69240 

agttgtaatt atgtgttgtt tattttatgt ctgtctgccc tcacagaatc cacagtccaq 
69300 

69360 Ca9aa atCCtgcctc ttttat ttat accacatcca cagtattatt agtgcctgtc 

acctagtagg tatgcagtat gtacctattg aataaatgaa ttgacttctg tcttttagat 
69420 

cgtctactca ttttatcatt gatgacaaac ataatacctt acattcgtgt agtctttttc 
69480 

actcctcaaa gaggattttc tgcatagctc ctctgagcct cacaaaaccc tttaaggaag 

69540 * 

69600 9aata ttatcagata aa 9attgtga gacacagaaa agccagatga tttggcaatg 

ctcatagtac cagaggcaga aatacagcta gaacagtctc ctggcctcta atcaggagtt 
69660 

ctttccagaa cactgcttca tcttccattc tcttgggttc tttctatcct tactttataq 

69720 * 

ggcaaaatgt gtgcaaagta taatccctct tttgcaatgt gtttttagtt tttcagattg 
69780 

gaatcatgta ggctttttat gcccttaata aatatcagtg agcacaaagg aagtcctgtg 
6984 0 

agggcttata atcattttgc tcccattaat tccaacactg agcagtttcc ccatttccat 
69900 

69960 9CCtt gtgaa9ctct ttgctatccc tgttaaaatc taaagttgct tgaaccttct 

tattgcaaaa atgcatctta aacattctaa tacctctttt ttaaaaaacc aataaaqact 
70020 

acgtcaaaaa tcagccatca atcgagaagc cctgcagtca tttgtgtgct gttgtcccta 
7 0080 

70140 aa9t9 aatgtgctga gctctgcatt ccccacctag ctcctctgtg atcagggtgg 

70200 CCCa9 gacaactggg cc S a ggctgg aaacaccatc tgaatgtctg accacacaaa 

gttgagtggc tgatccaggt ttaaccttga cctcatcagc accaccttct aaqcaacact 
70260 

ttggctcaga agcccagtta tttattccaa gggatgattg aatgcagtgc tagtqtttct 
70320 

tcagggcttt tgaactcatt tatttatcca gtcatttata aaagatgaag aggagaacaa 
70380 

ggtaggccaa agtggctttg tactattaaa ggctgcttga tttctaagta catgttcttt 
70440 

70500 CtttC tgccattcca catt <=tagaa gccatgggta agtcagcaca gggatcttaa 
70560 taaCa ttggttttag 9 a 9gtctcgt gcataatgga ccagacttag agcacaatgc 

tgtaaggtag tgatttaggt gagcagcaga ttctggcttt aggagtttat tatcaqatgc 
70620 

70680 aaa ° 9 acttgtggcc ca 9g a tccct gcacccatgg gaagcattgt agccttagaa 
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ctctgggaat • tctgaatata attcctgaat caatcgtaag gatgcatatc tgatgcttag 
70740 

tgcaaaccaa gaggcagaat atttgcaggc agtgtatcct tgaaaaacaa atctaggtca 
70800 

ttttcctgcc atgcttcaag cttacttttc catccttcct gatggtagta ctaactacat 
70860 

ttgtagacca tttacgtggt caacactgtg ctaagctgtt agcttcattc tctatgagac- 
70920 

aggcactctt agcccaactt tacaattggg aaaactgaga ctcaatgaga taaagtaaat 
70980 

tctttacagt cattatgcta gtccatgaag gagctgcgat ttgcaactaa atctatctga 
71040 

ttccacagtc tttgctttta accagaggtt agcaaactac ttctgtaaag ggaagacagt 
71100 

agttatctta atctttgtgg gcaacatagg gtctctgtaa cgtattcttc tttctgtcac 
71160 

aatcttctgg aatgtaaaaa acatttaaaa tttacaaacc ttacaagaac agctcatggg 
71220 

ctaaatcgga cctggattta gtctgtgaat catagtttgc tgaccccgct ttttaaccag 
71280 

tatgtaccct ccttctcggg atgtgaaaaa ttagtgcaat tgcaatggaa aatagcaaga 
71340 

aaatggtaag ggcctggaag aggcagcagg attacatcag gtgctatccc tgctctggtg 
71400 

agatgaaact ggggatcatt gaaccacctg gcatttgtta aagagttctg ctttccctct 
71460 

gagattcttt caggaacctc acacctctag cagcccggag aaccgtgggc tgcaaggaaa 
71520 

tgcctcctca aaggagtaga aaacctgcag gatagaaatc atcacatctg • tctggctttt 
71580 

ctcaaccttt ctcttctgca ctttcttgga tataatcaaa gcactaccag gaactccaga 
71640 

gtcggcacct tttcattttt gtgttttcat ttaattattt ctcagctgct aagtgtttga 
71700 

ctgtttaagg gactctagtg gtaaatattt gtctttagcc tggcagaagc tgtggtttcc 
717 60 

tttgatgagc tcacacggtg tggcttttaa gatgctgctg accaggacag : ctgactgtcc 
71820 

ccagtgggtg cagtccccag cagtgggctg gaccccttcc agaaagcgct gctgggccaa 
71880 

gaggcttcct ccaacttccc gctgccccca tctaaccaac acctcagtct cttctccacc 
71940 

tgcttccctg ccctcttcct ttccctcgca gacactttct tctgcctggc aaaaggaatc 
72000 

ttgtttccat ggaagcctca ttaaatctgc atcttgctca gtttgggttt gatcacggct 
72060 

gccagaagta tttttagccc atgcagttgc gtaatgagat agagattggg gaaaggggga 
72120 

ggtgactgta taggcagagg gtttttttaa aaaaaagtga gaaagagaag gaaaacctct 
72180 

aaagaaaaga gttttatgga attggaagaa ggatggagca cctcttttgg gagcatgagg 
72240 

ctggtgttct ctggttagct cttcccactg gaagcccatg gacacttgcc ataatacctg 
72300 

tcctggtcac atgtcagggg aacctctgat ctccctttcc atgagcttag ttggcccagc 
72360 

cagggtgaca cttatgctag ggagtgtgat tgatgttgct gcttacagat ttcccctccc 
72420 

acagacctga tggggcagcc aggatagtgg cagagaagaa gacagagcaa tagcaggaaa 
72480 
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7254C> gaCaa CaCtaacaca ttggaggttt atgttcaaag acgggatcta gggggtcaga 

gaaagcacac ctaccatgta attggtgctg gaatctgatg ccaagtgcac ccttgqcttc 
72600 

tgaggttctg agaactcttg cttgtgcttt tcagccagac tatgccctca cctgcccctq 
72660 ' y 

727^Q taaag agCtCtttag g ct ggsgtgg ttgtttgcat tggattgttg gagtgtgtgt 
g^tgttgtt gtgttcttgt attacaagac aaagagatta aaaaaaaacc acatgcagct 
7^840 a9Cta atgtttattg aact tttact atgccacatg gtgttttaag cattctatat 

ctgttaactc attttcccta attctatgga ctagacactt aaacagtctc cattgtacaa 
72900 " 

acaaggaaac tgaggcacag agaggttggg aaactcattt gaggtcctcc agctaattaa 
7 2 960 

tagtggagcc aggttttgta cccagacaac ctgatttgag aatctgcagt cctagattag 
73020 

taacgtgttg ttggcctgtc acacatttta aatgacattc tgtacacaga accatttata 
73080 

gtaactttgt attgttgagc tgaaagcagt ctgcagatgt gctgctggga tttcattcat 
7314 0 

cttcaaagag gtgttttttt ttttttttaa aggaaaatgc ttttctgagg gtggtatcta 
7326C> ataaa aatctttacg atcaagattt tcacaaattt cattctgact ctgttgcatt 

gcccttcttc ccatattccc agttagtttg tattgattgc tgcatctccc ttgagcccat 
73320 

ggtcccccac aacatttctt gcagaactgt gtcctgcctt cacactgtca ggcaqcagqa 
73380 ~ " ^ y 

gcctctctag cggccagccc acagtc'ctgc agctccttcc tcaggacgtt taatttccca 

734 40 

^tttctatg cagttacctc acagaaggat ggctacgagg gcctcacttg gcttggcaag 

ttggtcccct ttttactcac aagactctgt ttatctcttt gtttatcttt gtttatctct 

735 60 

ttgttgacct gcccctcttc aaggcctcag ttttctctga agtttacagc ttccctcctc 
73620 

atcccgcaaa agaccaaagt ggaaaagatg aaaccagaat ccactgcaag ccccacctqc 
73680 * 

73740 CCtCt CCtctaaatg cattctctgt tgtgtttagg acttgagaat gaagagggac 
73800 tt9a9 gatttgttta ttattcttta caatatccct gtgagctgag tactgtaaat 

acccccattt gatacatgag taaactgagg tgtggagtga tagaggaatt tgctcaaggt 
738 60 

73920 aaCta gtaagtgggt gg a g c tgtga tgtgaaactg ggcagtctga ttctgggacc 

tgtgctctta atcaccaatc tatattgcct cctacttgaa aacatccagg gaaaatqttq 
73980 

74040 9atCa gctgaaatct tcttgcacag taaagcaggg gccacctgtc ctggagttac 

attcatcttg ttcattgtca acgatttgtg ttcagtgaca ccctcttcag cccaaqaact 
74100 * 

tacctgggtg ctgtgacaat tggacatgac taggaacaac cagtgacatt gtagcccatc 
7 4160 

74220 aCa9g gtaggaagtg gatgcttgtc actctctttt ggttataaga agcaggaacc 
"ISO 33 " 0 accttttata tatctataaa gttgaatata taagatatat gggggccagg 
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cacagtggct cacacctgta atccgaacat 
74340 

aggtcaggag ttcaagacca gcctgaccaa 
74400 

caaaaattag ctgggcgtgg tggcacacac 
74460 

caggatactt gcttgaaccc gggaggtgga 
74520 

actccagcct gggtgacaga gcgagattcc 
74 580 

gggtatgtgt agaactcaca gaagggcaaa 
74640 

tgggaagttg ccagtacctt cctgtctttt 
74700 

tctctctgca cctctgtctc atttccctct 
74760 

cacacataga aaaatcagga tccaccctcc 
74820 

atagtatcct taaaactcca cattccaggg 
74880 

ggcagcgaaa tggagtaaag atgcaactgc 
74940 

ggaagccgct gttgtggtcc aagtgacaat 
75000 

gcttttcaca gatggagaaa ctgaggcaca 
75060 

gcctttgtgt caatgctagg tcactggatg 
75120 

tctctacttt gtcccagagc ccactcagag 
75180 

cttaggatag tctcaggcca acagccccag 
75240 

gaaagctctt tgctgatctg tcaatggcac 
75300 

atttgaatac tcatctccaa aacctgtgtt 
75360 

ctcacaacgt ctctatgagg tgagcgcttg 
75420 

gctcagaggg taagcagcta gttcaaggtt 
75480 

ctgtgtctga ttcctaaggg cctgtgcttt 
75540 

tcatttcctc atctgtcaga tgggaacgat 
75600 

gaaaaacaga taaaagaaga gaaagtgctt 
75660 

attatttttg ctgctgacac acatcctcag 
75720 

tcagagctct ggaggcacct cctccccatc 
75780 

tgagcaggct gacaaactga accccactcc 
75840 

atagctcatg ctgtgttctg tcttgttgct 
75900 

cagcaggttg gggagaaaag gaaatgtatt 
75960 

ttcatttgtg gaatcagatt gtgcatgcaa 
7 6020 

gatgggaact gttggtttta taattgaagg 
76080 



tttgggagcc 


caaagcaggt 


ggatcacctg 


catggtgaaa 


ccccatcttt 


actaaaaata 


ctgtagtccc 


agctacttgg 


gaggctgagg 


ggttgcagtg 


agcagagatt 


gcgccactgc- 


acctcaacga 


aaaaaaaaaa 


gaagatatat 


caggccttaa 


caggtgctga 


aaacaggaac 


cccctggaac 


caaacggttt 


cttacttgct 


ctcttcagat 


gatttttcat 


tgttgcatca 


caagtttaca 


tatcgttgtt 


tcaggcagcc 


agaaagcttg 


ggtcaaggat 


tcagccaaag 


caggtctatg 


ggcagcaagg 


aggccgggaa 


tcaacagctc 


aaagcataag 


taagttgtgt 


gaaggaacct 


ggctggggtc 


caggtctctg 


tggcgtctga 


tttctacagg 


aaatgtggtt 


cactggctgg 


ccagggggtc 


ctagggccct 


gacagaagca 


accaaagtga 


agttatgaaa 


ccttgtagag 


ccaatactta 


gaacacctgg 


ctttctacca 


cgtgacaagc 


ccttgtaaac 


cagatccaca 


ctttagataa 


gcaaatggag 


atgcacctga 


gccaggatgt 


ggacacagct 


agccactttg 


caatactgct 


gctgtctgct 


aatactcaac 


tcacatggat 


actgtatgag 


tgaaaacata 


agcagccctg 


gcagatggga 


ccttgagggc 


tctgctgagc 


catacccagc 


aacagcaggg 


gggacattct 


gtcttcatcc 


tccctcaatg 


tccccatgct 


gggaaggagt 


oaaaaaatflT 

y ci y ci y a a i— y 


Q y Q a ^» ci y ci 




tcttccccca 


agatttcttt 


ttgaaatatt 


gtttcttcca 


gaaatgtaag 


acgtcgtaat 


atgggaaagg 


aaactgatat 


ttatggagca 



46 



) 0233O86A2 I > 



WO 02/33086 



PCT/US01/32152 



cctgttctat 
76140 

tattatccac 
76200 

cccacaggta 
76260 

taccactgcc 
76320 

tctagcagac 
76380 

gagtcagaac 
7 6440 

agcagtcaac 
76500 

tgcacaaact 
76560 

aactcatgag 
76620 

ggggacacag 
76680 

cagaatggga 
76740 

gaaagcagac 
76800 

cagttcttga 
76860 

cctgaagtgg 
76920 

caacaacaaa 
76980 

gagacaggtg 
77040 

ctgctttccc 
77100 

caccaaatac 
77160 

gagtatgcat 
77220 

atcattcttg 
77280 

ccagaataaa 
77340 

cagctctaag 
77400 

caattcctga 
77460 

gtttctgaag 
77520 

ataagatggt 
77580 

gaattcactt 
77640 

agttccaggg 
77700 

ttaggccatc 
77760 

gcagcactca 
77820 

aggccattca 
77880 



accaggcagc 
atttcacagt 
ggtggctgca 
tcccattggt 
taggaagctg 
tagatggggc 
attgggtctt 

agtgggagcc 

ctagggcagg 
gctccatttt 
agtgtcaaga 
atcagagttg 
agcagtgacc 
cctctgtgtt 
gtgttgccgt 
ttactctctc 
aggaccacac 
cctgtgcacc 
gctatggtct 
gttagataag 
ctcagaccca 
gtataagcct 
ttccacatta 
cccatattaa 
ccatattatt 
tggagaaatt 
atgatgaaaa 
ttttttctct 
gacccacatc 
ttctgtggga 



tacccaacca 
ctgagtctga 
gcgctgggat 

ggggaggcca 

ctggaagcta 
tcccctggcc 
atgcagagcc 
tgtgagggaa 
taccagaggg 
catgaccaaa 
agctttgttt 
ggaaggctca 
ctgccttatg 
tagaagaggg 
atgtgttgtc 
cacttggcaa 
agctggtaaa 
acgcaaacac 
ctctcatttt 
aaatagccaa 
aagagatgct 
tgtccatctg 
caacaaatac 
atgccaaaat 
ttagaggatt 
gggatttttt 
ccaggaatgc 
caaactggtt 
ttcagtttaa 
gagataggct 



tcagccattg 
ctcagcaatg 
ttgaacccat 
tggactggct 
cttatgcaaa 
acttttccat 
acctgtgttc 
cagcctgtcg 
tgtgttccac 
gactgagcag 
gacaattgag 
ccccagctcc 
tcatgtgttt 
cctgacatga 
tcatcccctc 
atagggaaaa 
cagtgtccat 
aaagacaact 
gtctttcagg 
cacttctgca 
cagaatgtgt 
tcacattatg 
aatttcaggc 
ctgagtcagc 
ctttggttgc 
tggcttgcat 
aaatgtctcc 
ccctccactg 
atgttggaaa 
atgtaaaaag 
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ttgcaatgtt 
ttgtgttcta 
ctccaaagcc 
gtcagagatg 
ggtcagcaag 
gctggcccac 
aatggaaaca 
ggttcattga 
ccaaatgggg 
agaggctctc 
tcaagaggac 
ttgacaaagg 
atcaaagccg 
ttctctgaga 
aatagtcctg 
gagggcccag 
ctcagctgtt 
ggacaaccaa 
gctataccct 
gcatggtagg 
ggagttaata 
acatgtgctt 
tttgaactaa 
tactggaggt 
aaagggcaga 
aagcaaagca 
agaattcttt 
ggctggagac 
tggactgtca 
atagccactc 



atgcaagctt 
tgtgctagtt 
tccatctttc 
tcctttccag 
gaaggaaaca 
atgtccggct 
tcctggacac 
ggttcagccc 
caggtaggca 
tgagcagtgg 
agaaaagaca 
tgcatgaggc 
gcccatcagc 
aaggatttga 
tgaggtatgt 
agaagtgaag 
ctgtctccca 
gtcatctaat 
aggagagcta 
ccaaatacca 
cttcactata 
gctcccacct 
caatgccaat 
agagacatga 
cacccagctt 
tgagaaagaa 
cttttttctt 
gttactacca 
gagaacattt 
ccatgtgaac 
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aatgtggtta ggattagagg catgaatata ccccaaacca ggggtgtggg aaggaggttg 
77940 

acactctagg tgataatacc cagaccttaa ggagctttct gtctagaggg aggtatggac 
78000 

atggacaagt aatcaacagc tacaaagcag agctgccagc tctgcaacac aagagccctg 
78060 ~ " ^ 

agaggcatga caggggcagg gtggggatcc atgtgggtct ggattgaagt gaggaggggc- 
7S120 

atcaggaaag cattccagga gagctgaggg acacttgagc acaccctcaa agaatgactg 
78180 

ggggtcatga ggtatacaag ggaggaagtg cacccgagac agaaacaatc acataagcaa 
78240 

aaatgcagaa gaatatgagg atcggggaag ggcaagtagc tcagtagtgt tggaggccaa 
7Q300 

gggacacgaa ggaaggtgat aaagccctga tgttaaggat agaaaaatca aagtcctttg 
78360 

aaaatcatgt ggagttagga tctcaagaac cctacaagga tttctttaga atagaatcaa 
78420 

agaaaaacaa agtttacagt ctgtgagggt tgcataggaa gtaacgtggt gagaaatgtt 
78480 

ggcttgagaa ccacatatcc ataacacaat ggtgttttag aggatttggg ggaagggaga 
78540 

gaaaatctca aattgtctca gtaactaatg agctttcatg tacatttaaa atagtaataa 
78600 

atgcaattgt gaggatgatg gtgagatgag caaaataatc cagtttgtaa ttgtagttat 
78660 

caggctggca tatcctgcag gtcacacttc taaacatgac ttcgaaaaat caaagatcag 
78720 

ctaagtttga agtaagtatt gaaagaggga gattatgttg cctcaagtta aaatagaacg 
78780 

taaaagatgg tgattcaaat gatcaaaagc accaagcttc cctgttagga ttcaagggag 
78840 

gggtgcgtgg ctccgacacc agatatctgc aaagcaatat gaaatgagat caatagtaga 
78900 

cattgaaaga ttgaaactga tataggatat tcaagtacca gcttcaagaa aatgaaatga 
78960 

gacctaataa aagagagtag gagtcaaggg ggtatacgat attaaagaaa gtgaagagcc 
79020 

agggtttgta ggaaggaagg gagaagaggc aaagagagca gctcttttaa cacaggagct 
7 9080 

tcctcctttc ccattctccc tcctgctaaa agccgagttt gttttagctg aaatgattgt 
79140 

aagacaaatt tttattatta aaaaaggagc tattttgtgt tggtttccat tataaaatca 
79200 

gagctctgct gccataaaat taaatcccat aataaaatga gtagaaaacg tgatgtcctg 
79260 

cagaaaggaa gatggcagcc cactcagtgc catgctgggc ttgactatat acaagccgtg 
79320 

catctcctgc tgcgagttgt agctgctgcc cagcagtgca cattatcgtt gcagctgttt 
79380 

tcctcacatt ctgaggttta tgaaatccct catccatcaa taattgatct ttagctctta 
79440 

gtccaggggt tgtcaactgg cactccatgg acctttagag gattgatggc taggttttca 
79500 

aagatctttg aaccccctga aattatatac aaaatactgt gtgtgagtat gtgcattttt 
795 60 

ctggtaagaa gcacctgaat tatcgaagca gtttgtgatc ccccaaaaag ctaagaacta 
79620 

cttcctagag caaagggaga ttttgctaca cttagagatt tacacatttg accagggcag 
79680 
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ctcacacaag tgggatgcgg tttcacattt 
79740 

tcatcaagga aatattgtaa tacttctata 
79800 

ttggaataag aaccactata tatatacaag 
79860 

ggcaggaagg aatatgatta ga.ttcatgaa 
79920 

agataattgt gactttacta aatcaaacct 
79980 

tttggggttc taggggtaaa atgtgacccc 
80040 

tgtgaacggg cactgagctt ctgaaattaa 
80100 

gtgatagttt aagtactgat aattattcac 
80160 

cagtattcct taatgggaaa tcaacttaga 
80220 

ttctggtaaa acatgcataa atcaatggtc 
80280 

tcggggaaaa cagtagaagg caattgaaaa 
80340 

caaaagtcac aatggcttga ttgtgtgatg 
80400 

tgttcaaaaa aaagacagaa agggtagtta 
804 60 

aatcagaaac aatgaaaata attatttcaa 
80520 

ctgaattacc tggataattg gtaaaatttc 
80580 

tctctctata aataaataga aatgtatgaa 
80640 

aattatcttt tgggggaaaa ttggtagaag 
80700 ' ~ 

aaaacagcaa acggaagtca caatggctcg 
807 60 

aaacacatca tctacacaga tggattcaaa 
80820 

tgtgtttcct ggctctgttc cctcatatct 
80880 

gaaagtgtca ttttcaaaac ctgcttcttc 
8094 0 

tccacaatcc ccacctcttt cacccagtag 
81000 

tacttcccct tcttatattc aaaatgtttc 
81060 

agcatttatg agctcctgtt atggtttgga 
81120 

aaacaagtaa tttcaattga gtatttagta 
81180 

gctgtgggag gctccccaaa tcagtctaag 
81240 

gtgtctgaga tggggaggat gtgtggaact 
81300 

tagaaaaagg cctctttcag catgctaaga 
81360 

aatttggagg gaaggaagag catactgagq 
81420 

gttgggtgag tgaagttgag tgggtacaaa 
81480 



catggcagat ctgcttccag ctatacaaat 
tgaatcagga attcactata tttaacttat 
tttttccaaa agactgaagg ttcttcctgt 
gcgcctttat gtttatattt caactctgaa 
gtataccacg attaggaaaa tgtggactga 
tgtgaagtac caatgcaccg ttcttttatc 
ttagtaggca ggaggacatg cgcatatgac 
ttggaaggga agagaataaa attcagaaca 
ggaggtagga gggagatcaa gcaagaatat 
agccaatgtg ttgatcaaag aaattatctt 
acaagcatca ggctgcataa aaacagcaaa 
aggtaattaa tggctgcagt tagcaaaata 
caggagaaaa acatccccgc agatcttcaa 
aaattaagaa aaaaactctc taatttatac 
ctgcatatac aaatcttggt cctctgctcc 
tcaatagtca gccaatgtgt tgatcaaaga 
ccaattaaaa aacaagcatc atattgcatg 
acggtgtaat gaagccacac aatatgtatt 
gataccttct ttgtgtctaa gtcccaaatc 
agtcattctc caagtcagca tgcccaactt 
tcttctggaa gttcttcctc tgcccattgc 
caaaccttaa atttatcttt tactttgtct 
tcacttgcat ctcttttcat tcatttcata 
aactgttctt catgctggag gtggtcttat 
tgttaagtgc catcccaaag gcaaacacca 
gaagttggga aaagcatctc agagaagatg 
gggcaaggaa gagaacaagt aacaacattc 
agtttggagg acagaggagt taccattcaa 
tttgccactt gaacagataa tttcagctgt 
tcaggtcagg aatataagtt aggagactgt 
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tactagaatc caggccagag gtgatggtgg 
81540 

aaaaaagaaa atgtgttcat gaggtagaag 
81600 

ttggaaatgg gggtagcctg gagaggaagc 
81660 

catccactga catagggatt aaaggaggag 
8172Q 

cagctatgga atgtttgaat ttgatttctc 
81780 

gctcagggtg tagacttgag agtggatggg 
81840 

aaaggtcaag gaactgaggg ccaaggattt 
81900 

cgcaggatgc tggcaggaaa cctatgagcc 
81960 

gtgtcagaag cagcaggaga agcattgata 
82020 

gtagcagaga gagttatcct agctgacaca 
82080 

ttaggaaaga aagagcagta tccacaggag 
82140 

tacctagatg aattctattc tcaagggact 
82200 

cagagcaaca ccaaacagat gtatatctca 
82260 

ccacatgggc ctacccactg tctgctaaat 
82320 

agtattcata atcttctttt cctaatctct 
82380 

tcaatgacct gctccatccc agccctttca 
82440 

ctctccattg attacacact tcctgtagca 
82500 

tgtacacata tatttcaatc tcttggtgga 
82560 

gtagcacatt tccttgcata. tgtagatgca 
82620 

tttcctgttt aacatgggct gcaaagttct 
82680 

tgacggatgg tgcatggagc ttaaattcca 
82740 

aatttgtttg cctgtacagc tcttacagca 
82800 

agctaaaatt gtaaatggca gctcaaatga 
82860 

atatttgttg gagagtaact gcttaagaca 
82920 

cttttttatg gcagagcatt aaggaaaaaa 
82980 

tgtctgaacc tgctggacac aagtcccggt 
83040 

ctgggccatc aatagataga taaggtacat 
83100 

gataaatgga agttgcacag cagcctaaaa 
83160 

gcgccatctg cctgtgtctt cctgttcttc 
83220 

atttttatga tggttatcct aaagatatgc 
83280 



ccaatatatg 


agagttttag 


cagggaatga 


taggtaaaaa 


caacaggatc 


tggttcctga 


cagaatgcag 


gcaagaatgc 


atagtggtac 


aagaagcttt 


ggtaaagaaa 


ataagaagtt 


tgatgaggag 


tagttctagg 


tgatgataat 


taaagtaaag 


gttgaggcta 


ttaaaaggga 


ataataagtt 


atcttgggcc 


actaaagcca 


aggtcttcaa 


tgttgagtcc 


agtgactcag 


gcctgatggg 


gaaggagccg 


ttacctgaga 


gctctcaggg 


atttgcttct 


aaagcaatcc 


actggtgggc 


actggcttcc 


ccagaaaacc 


cctatttaga 


taaggggctt 


tgttagttct 


ttacttgccc 


ccacaacctt 


tctgctctgg 


gcacttcata 


ttttcttgtt 


tcagftgcctc 


gcccctcact 


tacctgaatc 


ttttgtattc 


agaaccttta 


atacctacca 


agtgaatact 


cctgttctat 


aattatgaaa 


tattacctat 


cagagaatcc 


aatttatgcc 


ttgtcaattt 


ccatgaatat 


ttagagaact 


tgttagttaa 


ggtccatgca 


cgtcttttat 


aaaatagaaa 


tgaagcagaa 


acatatgaga 


gatqgaqctq 


attgcttcca 


atttgtttga 


tttacctaag 


tttttctgta 


cattcagaaa 


atgagtttga 


tgaaaaaggg 


ggagattata 


gcttttaact 


aagtgcagat 


aaatgagatc 


aaatggcaag 


agccattgat 


agacagtgtt 


tatatgactt 


cagcggccaa 


tgttccagga 


agtttgagaa 


gcttccttag 


gagggctgtg 


ctcctccaga 


ttcttcacat 


taaatgcttt 


tccttttctc 


tagcctggac 


tttgacaagg 


acatctggag 
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ataagaaaga ttctgaatta 
83340 

tagatggcta gagattcttg 
83400 

gtggtaccta tagaagctga 
83460 

aacttcacgg ggctcctcta 
83520 

gcattgactc tgggtagtag 
83580 

gatgcaaaca cattcccaga 
83640 

ggagatactt gtcctattgt 
83700 

agatatcagg tcaaatgaag 
837 60 

aaattggcca agtgcagtgg 
83820 

atcaggagtt cgagaccagc 
83880 

taaaaaatag ccgagtgtgg 
83940 

ccggagaatc acttgaacct 
84000 

actccagcct gggcaacaga 
84060 

aaagaaaaaa gaaaagaaag 
84120 

tttgggcttg gagtccaact 
84180 

caactttgga ttttactgaa 
84240 

aatgctagta cctgtctcat 
84300 

tagcatagta tgtggcacat 
84360 

agagactcgg aggtgatgag 
84420 

tattatctca tttgaaaggc 
84480 

acctcctcag ttcactttct 
84540 

ctcagttttt atcatttttg 
84600 

agatttgatg agttaataca 
84660 

catgtaagta ttagctgtta 
84720 

tgaagtcttt gaggctcttc 
84780 

tacatgtctg ctttccccat 
84840 

tagcgttggc atcgtattat 
84900 

gttgtaggga ttcagtcaat 
84960 

aagagaaggg ccttggggca 
85020 

atccaggaga aatcaggtca 
85080 



tttttccctt tgggcaattg 
agaatatttc ttttcttgga 
catcagcagc agcctgcctc 
ctgacagccc catcggtttc 
agggtggttt atggaatctg 
gcagaaggct tggcatgtct 
tcttgagatt ccagcaaaaa 
gctttggtgc tacaacattg 
ctcagcactt tgggaggctg 
ctggccaaca tggcgaaact 
tggcgggctc ctgtaatccc 
gggaggcgga ggttgcagtg 
gtgagactcc atctcaaaaa 
aaattaaatt aaaaaaattg 
aagctaggct ggaatcccgg 
tctctcttat tctcaattcc 
caagttgaag gagacttaaa 
aaagaacact caataaatgc 
agaggccaca attccctcca 
agtatagtat agtggttaag 
gtctctatca tttattagct 
agagaggagt aataatagtt 
tataaagcac acatagtagt 
ttttattaaa caacatgtgg 
ccagtttcat aaatcagaga 
tggactgcga aatcttgaaa 
gatacctgac atttaccagg 
gtgggtcatt gcaggcgggg 
tcacattaag tagttaccag 
atatttcacc ttcatggcaa 
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tagcaatttt aaaactatgt 
aaatcataag gctttggata 
cagtcgatca gggcctttgg 
cctccagcac acgtaactca 
attcatctca gaaagaggtg 
ggtcttaggc agagggaact 
tagcccatta cagaggaaga 
tcttagaaaa aaaaagaaag 
a ggggggcag accacttgag 
ccgtctctac caaaaagtat 
agctactcgg gaggctgagg 
agccaagatc gtgccattgc 
aaaaaaaaaa gaaaaaagaa 
ttttttaaac aaaggaaggc 
tttcatctcg cttctctgtg 
ctcctctgta aaatgaagat 
tgagatgtgt tgaaagcatt 
tggctataaa gaagccagag 
tttcattgaa aagcaatttt 
gacatgcact atggagctag 
gtgacttaac cttcttgtgc 
cctactctgg tgtgttgtgg 
gcctggagca tattaaatga 
cataggacat attggaactt 
ctacagtata aatatctgct 
ctgttttatt catctctgca 
tgccaaatgg gactgggcat 
aggtgggtcg ggttaaaggt 
attgaactgc aaacattgct 
taccagtaca gtccaaggag 
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aatgcataga aggaaagaaa tcataatctg 
85140 

aataattatt actattccta tacaattttg 
85200 

gaaaaatggg gtgctaatct attccccttc 
85260 

agatagagaa gctataggtt atgaatttgg 
85320 

atgtttgtcc atgtcatgtg aagctcttaa 
85380 

aacttcttta ccccctaatg cctatcacat 
85440 

gtaaatggaa tttaatttct gaggtccagt 
85500 

cagtttgtct aaccatgtct tcccttccat 
85560 

ttccatcaca ttcttgacac ctcccatgtc 
85620 

gaattctagg aatct^naja cctagagctc 
85680 

gagacagttt cttaatttct cact ccaact 
85740 

ccaagtctcc atto-jgcit a tag.aattcc 
85800 

gccatgctcc aaaaca;ca: dttrrctcta 
85860 

tactttgaaa atgtgt jCdt gtqaqaaaaa 
85920 

acaagtttgc tactttaqaa acttgaciaa 
85980 

aaaatgtgtt cagcttggct qaataatgac 
86040 

gctcacagga tttctatcct caagccctag 
86100 

tcttaatgga gggtgtgcta aagaagcaac 
86160 

aaagttggtg aaatagagaa gatgcctaag 
86220 

taaccaaggt gatacaatgt tatgcaggac 
8 6280 

atcaggagca ttgtcaagaa tcacaacaac 
86340 

taagtaattt gcaggagggc atctcactta 
86400 

agaggcttaa aaaggtgctt ttcccaatgt 
86460 

aaacataggt caacctgaca acaagatctt 
86520 

cccttgatcc atggaaatgg accatcttca 
86580 

ggtatggatt gggtaatgct caaagacaga 
86640 

ctcattttct gaagacatga gcctaagctg 
86700 

gggcagattt tttgacactg ctgggtctca 
86760 

taactgcata gacttcactg ggctgttaag 
86820 

tgcctagtgc acatgcaaag gattattggg 
86880 



attgtatgtg tttttttagt agtaaataat 
tgtgttggtg tgttttgttt tgttgtgcat 
ccaacaccag tgctcagaag aaatttccac 
ccttgatgga ttctgggtca ctatttctca • 
gataaagaac aatgtcttac tcgtcttttt 
actttgccca tggaaactca atagacattt 
aaagcctttt tccatccttc ccctactaca 
catccacctt ataaacgtta ttactcattc 
caatgtcaaa caagtaccat ttgggaaaca 
ttcagaccct gaaatccagt tttctgagct 
ccgtttctcc tctttctcaa tggatatttt 
agaaaacatt caattttccc ttctcttaat 
gacattgagc attggagaga gatggaaaag 
tgctaagtgt tctgtctggt cacttcaatg 
acagagtgtg aggaaaaaca tgaaaagaaa 
cagcagggtg aaaagataag ataaccaccc 
aaggttgaca acagcagaca ctgaaactac 
attatagccg cttttaggaa agcaaatagg 
catgtgagat accacctcca tcttggaaaa 
cccttaatta aaacagattt agtgattaat 
agcaattagt tactattgag caatttctgc 
attatcacat ccttttatag atgagaatat 
tattcagcta taagtggtca gtcatgactc 
cactcttaac ttctcttctg tgttgtaata 
tatactgctt ttttgcctct ggaatgtcca 
gaggaataga gtattaaaaa gatccctggc 
agctgtacca tttaccatct atgtgaactt 
attcctgtaa ctgtcaagtg gaagtgagcc 
agaataaaat gaaataactg taaacagaag 
gctttctacc cttcagggat tagaagttga 
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tagtaggcaa caagttataa gaaatacagt 
8 694 0 

ttgtctctgg aaaaaaggac ttgcctctct 
87000 

tgcagctgaa tccccaacag gattttgttt 
87060 

acccccaaaa ctaaccccat aatttcattt 
87120 

cctgcccaac ctgcagggtg gttgggaaac 
87180 

gtccactgta aagagctctc cggagatgat 
87240 

catggattct tggaaaaaca atcactctac 
87300 

gggagaggag ctaggtcagt ctccaaagtg 
87360 

aacttatgaa ggtttttaaa atcagaattg 
87 4 20 

tgtgaggtgt gactaggtca cgtctccttc 
87480 

tccctccttg accccaggaa aaattgccgg 
8754 0 

ttggtacaga tggggaagaa agaaatgcat 
87600 

cacagaaggc tggtgaattt gggctatcca 
87 660 

tggcctcctt caatttcatt ttccctttgg 
87720 

cctcagttgg ccaagcattg gagaaacaga 
87780 

ggcttacagg gcaaggtgag agaagacaga 
87840 

tacaaactag aaatgagacc aggttcggaa 

87 900 

gggacagttt cttcaggaaa tgggatctca 
87960 

aaatatgttt cattgttgtt gttgttattg 
88020 

ttttgtaaaa acaactgaac aaatgcaatc 
88080 

ttaggaatat aacccccttg gagacgttcc 
88140 

gcttcagcta agtagggtca cccccccaag 
88200 

tttttagaaa atcttatcaa ggatatttaa 
88260 

cggttttgga caagtatgca ggggtcttgg 
88320 

tgtcacttcc tagctgggtg accttggaca 
88380 

cctcatgggc aaaatagaaa tgctactagt 
884 4 0 

agaaggatta aatgtatgta aagcacagtg 

88 500 

gaggggagca tcctaaggtc ctccgaattt 
88560 

cccttcaaca gcaccttcag aatctccttc 
88 620 

tgaatgagaa attaggcatt ctttccctgc 
88 680 



caattgtctg ctgaccaggg ctagagttaa 
ttctcttctt cctccaaaac ttaagacgtt 
tcctttggga gagaggaaac agaccaatat 
cagcagtaaa gtgaggtcct tgataactgc 
tctgaatggt catgcatggg gaagcattgt 
aaatctcatc agaaggcttc atgcttgagg 
gtatgtggtc agaatctaaa ggagatgctg 
gaacagtaga aactaatcat gtggagccta 
gccaccttcc tttggaccat gagctcagat 
ctgcccctgt ttccctcctc tccctacctg 
gatatgaaag ttaattatga cccaagggaa 
tcaagagcat ttccatcagt attgaaatta 
ttcttgcctc cctctgtgcc cataattcct 
ttcagaggaa tgcttgatgg cttaagctag 
gaggtgtatg acacagctac actcccatgg 
agttgtatgt gctgggtgcc acgtggtagc 
gaggaagagg gcttgcagac ctgagtcatg 
gctctgcctt gtatgcaggg cttacataat 
ttgatttaat aagattttgt tttaagaaga 
tcctgccaga gcaggcagca gcaaaggaga 
ttcacctacc tggtgctgga ttacctaaaa 
aaattatttt aaaaaaattg aaatctgata 
ttggactatt tacacctatt tagggtcagt 
aatcagacca ctggggtcaa atcctagttc 
aagttacctg acttctaata gcttcagatt 
acttaatagt gctctgagaa ggattcaatg 
tttgcccata ggaagctgtt atttataagg 
aggagaacta aaaatcttac actgacttct 
atttttcata ctgttctttc aaccctttga 
agattttccc aaaccttctg ctttggccaa 
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taaacatatt tttagtccca atcttgcatg 
88740 

cccctcctgt gctcttgaat ccaataccct 
88800 

gtatctgctg ttaggcacaa tgatgacccc 
88860 

ctggaccttg atgacccaca tagcctagag 
88920 

agggcagttg gaaaacttca caagacaatc 
88980 

aggctgctaa aaatgagctc gaggaatcag 
89040 

attccatctc ccaattttct gagtcaagag 
89100 

ttatccaaat ttcctgggtg cttgtccaaa 
89160 

ctgggactgg gacttaggaa ttggcacttt 
8 9220 

tacttcaacc actaccctta tagaagtgct 
89280 

tggaaggagg ttttgtcgaa gggagattct 
89340 

ccgctttgtt tggttctatt tcaccaaagg 
89400 

ttaaataatt agttttggta accattggta 
89460 

ggtggcttga gttcaattct ctgctctgcc 
89520 

gccaacctat ctatgtcatt tcctcatgta 
89580 

gaggacccag tgaaatgatg tgtaaccatt 
89640 

ttttttacat cagcttggag cctaccatgt 
89700 

tccgtgcaca aataaaagga cttctctttt 
89760 

caatagaatc catgacaatc ctgggccatg 
89820 

gagtctggtc tgaacatctt ccaaactcca 
89880 

ccatttgctg tctcctgact caattgctag 
89940 

aaggctgtgt ctgggccagt gtagcgagca 
90000 

cttttcctgc gagcctttac taagcactgc 
90060 

tggaatgcac aggtaaatct ccttggagag 
90120 

caccgatcct caccgctgag ctccatcttt 
90180 

caccatcacc atagcaacac agtggtataa 
90240 

agtctgagta tcaactcttc cagcatggag 
90300 

atccagacga taactggcag ccaggagggt 
90360 

aggggaagga aagggaacca tcacttgctg 
90420 

cgcacgacaa cccaatgtga atgttatcat 
90480 



ctcctttggg 


acttttcatc 


tgataaacat 


tcttccctgc 


cctccaccca 


gagtctcctt 


accaaggtca 


gacaatggct 


gtggcctcac 


cccagagatc 


agccactgat 


ggaggcccag 


cagcctgatt 


gttttgacat 


gcctgacttc 


ataggaaaaa 


gagataggtg 


atgcaatttt 


ttgtttgttt 


aactccagtt 


aaattagtat 


gaaaagtacc 


ccagatctac 


aaattagaat 


tacaattata 


ccagatgttt 


ctaatatgag 


gcctaggacc 


ctctcttctg 


gcaggtgaag 


ccacttcaac 


ttgagtgtct 


tggcttgtat 


ctttcatctt 


cacataaatt 


ttcttcagct 


tactggaaag 


aacattagat 


ttggagtcca 


atttaccagc 


tgtgtgacat 


tgggcaagtt 


aagataatcc 


cacttcacca 


ggccactttt 


ttaggaacac 


tggatcattc 


tacagtgcaa 


aggcattcaa 


atccactgag 


tgtatggagc 


ctgcccgtgt 


acaactttgg 


tttccttaat 


gtataaagat 


gggactttct 


tcctgtgaag 


acataactga 


tgtcatttct 


ccacccaacc 


agaagccact 


taaggaaggt 


tcctggagtt 


gttttcaaca 


gtcagtcctc 


tttatcttct 


ctcctcctgt 


ctccttactg 


catctcctga 


taccagccag 


gaacagtcca 


cagccaaggc 


cctttcaagc 


tgtccttccc 


ctcccctccc 


aaaaatgaaa 


gcgctaaggc 


atctaaatat 


ccgaaaacct 


agggaatgac 


agctagaggc 


ggataagtca 


aaggaagggg 


tcaaggaaag 


agcctgctgc 


ctgtgctttc 


tcatgtcacc 


ctccaggtaa 


ctgctgaaga 


aacggaagct 
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caaagaggta agagatttgg 
9054 0 

actcaagttt ctatggcccc 
90600 

gcaagtgaga gattttgctg 
90660 

agcccagagc tcaactaaat 
90720 

gcagcacctg tgtttctcaa 
90780 

tttcccaggg tgcgtgatta 
90840 

gaacttacac cagtgcagaa 
90900 

tgataaatgc cattattttt 
90960 

tcactctgtc gcccaggctg 
91020 

tcctgggttc aagcaattct 
91080 

accaccacgc ctggctijait 
91140 

gggtttcacc atgt t tig j r<a 
91200 

tccacatccc aaagt i:ta: 
912 60 

gatattttta ttaagcaict 
91320 

aaatgcagct cctgcrrttg 
91380 

aattatacaa ttgattg^tt 
91440 

aatatcactt gcctgatqac 
91500 

caatcccctg ggccaattat 
91560 

agaagcttag cctcaaccat 
91620 

tggtgtatct tcatactaat 
91680 

acccaacaaa ctagactcaa 
91740 

agcctttggt atcactgaca 
91800 

gatagacaag gtaggtggtc 
918 60 

gtgggaagtg cctgtgaatt 
91920 

gcatttctgt ggatccaaag 
91980 

tgtatgacta agtgagaatc 
92040 

aaaaatgctg tctaaacaaa 
92100 

taatgctctg ctacctacct 
92160 

agatctggga aagtgttttg 
92220 

taaatgtgtt atattactgc 
92280 



ccaaggtcac acagctataa 
agaatttatg tgtttctctc 
gaagcactgg gctttttgag 
tattggaaga gactgggcca 
ggactgagga aaatgaaggg 
tatggcatgg gggtgggggc 
agggtgtgat tagaagccct 
tccctcattc attcaataga 
gagtgcagtg gcaccatctc 
tgtggtccag cttcctgagt 
tttttttttt tttttttttt 
ggrtggtctc gaactcctga 
gittacaggt gtgagctacc 
gatgtgtgct taactctgga 
taqagcttat tacjgatagtg 
ctttacaact gtaacatgta 
ttcagtgaaa gggaaataca 
aaaggtgatg cccactttca 
gggtttatat gattggtaga 
catgaaatcc atctaatagc 
atgaaatctg atgagggaat 
taaaactgaa tgtatgtgct 
cagcccatgg tactggcagc 
atgcaggagt gggaggggag 
ctttccaaac tacctgcaag 
agataatata atgtgtatgg 
tagtgctgat atcaaagata 
gccagctgtt tcccagggat 
aaatccttga ttaaaggccc 
cactattgtt atgctttctt 
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gcagtagaac taagatttta 
tccataccac agggacaggt 
caggccatat aaaaattctg 
aatataaggc ttctatctaa 
ggagggttgg caaggctgca 
cattatgatg cccggacatg 
aagccagaga atgttcagtg 
tttttttttt agatggagtc 
agctcacggt aacctctgcc 
agctgggatt acagatgtgc 
tgtatttttt agtagagaca 
ccccaagtga tccacccacc 
gtgcctagcc tcattcaaca 
aatatagggg tgattagaac 
gagaagacaa ataaggaaac 
ctataagtac ataacagaag 
gaagttctta caaatcaaag 
aggtggacag agactgtgct 
ccctgcagat ccattcccaa 
catacaagtg aggttttaaa 
ttatgatttg ttcttcctac 
gagggtgctt gtgtcttggt 
ttaaagtcag ccagccatca 
ggagtaggca gtaaagtaat 
tcagcaaata tgggggatgt 
agctctttag ttcttcagaa 
atgatacagt accctaattt 
gtggtaaaga tgaatgggca 
tccaggcaga tgtagaattt 
ttatcacccc agaatttcac 
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catctcctgt ttcaggtgaa cgagtctgcc 
92340 . 

aaggtagcag ccctgagatg tgccatataa 
92400 

ggccttcctg gctggctcct gtcagctggt 
92460 

aagcttattt tatttccatc aataggacag 
92520 

attggctaca agccatctgc ctctgaacag 
92580 

gatgtctt tt gcaagggaaa cagactgttt 
92640 

gtaaaggaac agcactcagg agaattcact 
92700 

agaaattggt actgggaccc aaaatcagat 
92760 

gggtgggggt tttatctgtc aataaaagca 
92620 

agttgccacc tctgcctact ggacagccac 
92880 

gccctccgga gtacatactg ataccattgg 
92940 

aacaggaccc tggaagccct gtccgtgttt 
93000 

tgccccacaa tccaccctgt gagagaccaa 
93060 

ccctgaaaag gcattcagct tccactgggg 
93120 

taaattccta agacaggaaa taaggaaaga 
93180 

tactcctttg tagttcttcc gcagaccaga 
93240 

tccaaaccag cacctgccaa acttctcacc 
93300 

aaaaaaaatc actttattgg ttgctccagt 
93360 

ttaagtgaca tgtacgactg cttattgtat 
93420 

cattatctca tttaatcctc tcaaaaaccc 
93480 

ttagattaag gaaactgagg ctcatagaga 
93540 

aattactgcc tccagatgtc tctgattcta 
93600 

aatgaataat gattttcatg gcatctgtca 
93660 

caaagtcata accaaataaa tgaacttgac 
93720 

attttcatgc atgttttgaa ggccaggaaa 
93780 

aggagctatc aaggtgaact tttcctaaga 
93840 

aatgtgtgca agaatagagt cagggagacc 
93900 

caaatcacac actaagcatt aagattgtct 
93960 

gggctccacc acttactggt attcattaat 
94020 

tgcgtgtggt ctatgtgcca gagactgtgc 
94080 



tgactcttac ctgccctgaa tggcattgga 
acaaacatgt ttttaaccaa gggatcagga 
catcacctct ctataactct aggctttccc 
gaatatgtaa atgtcctgct tgaaatgagt- 
aggtgaaaag tggaaatcgg aggaagggca 
tctgccactg cactctgccc aggcaaaaga 
gaagcgaggg cagggtgcaa aaggaacttg 
tctggcattt ctgggaaaag aaatgggcat 
tccagaatgg ggctagaagg aagtaaattc 
ggagaacttc tccttatcca aggtcgagga 
ttctcccaca cataccccca tggagataaa 
aaccaatggg attgaaacat ggaaatgaac 
agagcagtgt tggattaaca gggaatgtta 
cagcaggtac agtgcaaaga tgatcccact 
tgttgtggaa actcaagacc tctcaaagca 
ccacggaatt cagaaaacac cctacctggt 
ctcttctgac cctgtcctgg gagttaagaa 
tataacttaa acagacagac catcatcaaa 
gccagttact gtgctgtggg gttttggttc 
tgttaggtag gttttattat tgcactcatc 
ttcggtaatt tgtcaaaagc cctaaaacat 
aggcccaggc tcttaatcag taaatgatca 
tcggaaagaa caatggagaa tatgcttaac 
agcagagccg tgattctagc caagatgact 
aggaggttag acttgtttgg gaagggaaac 
gtagcccaat aatagtgctc gggagggagt 
agccaagtgt gtgcctcagc atccctagca 
ctgcagtgag aaaggcctgg gaccaaattt 
cattcatgca ttcattcaac aaatatatat 
tgggtgctgg caaagaacac agacaaggtt 
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94140 tCtCa tggagctttt attct gatga aggaaacaga ccacttacag ataaataaat 
94200 aQata aagggaaaca 9 atat Satgg agagtagctg gagggccaag cagaccgggc 

agacaaggtg gtggcatgta agctaagaca tttaaaaaga acctggtcat gagactatct 
94 260 

ggagaaggaa agctccaggc agaggaagca ggtagtgcag aggccctgag gcaggaatga 

94380 a9ata tttgagaaaa cagaacaaag gcaggcatga ccaggccgag tgggtggtgg 

944 a ? ta9ta gaaggtgagt gggggsgtgg gggcatcaag gtcaggcttt gcaggcttga 

94500 9ttCt CaCtgtggtt ct gg a gccag cagcatcaat gttacctggg aacttgttag 

9456? Caaat tCtcaggccc cacc< =agacc tgctgagtca caaactctgg gatggggcac 

«^o« tgt9t tttatc g a g c cctccagatg attccgagta tgctaaagtt tcagaattcc 
94 62 0 

taggttggat tatgcagttc aattttaatt ttaaatgcaa tgggaaccta tgaaagattt 
94680 " 

94?40 9999a gcagcatgtt ataa ttttct ttaaaaaatt gtttttaagc actcctgctg 
9480? a9aat ggaCCataac a gg ctaa gag aaatggaagc agggagataa attaggtggt 
94860 Caa9a ggCCaggtaa g aa g a g aaa g tggtttaagt agggtggtgt ggcagagaag 

^?^ tCCaa g ca g a ggggg accacgctga caaataagcg cgggccactc acgcaagccc 
94 920 

aacaaggcag aaggcagaag gcaaaagtga aggccagaga aaactggaca ccacctttcc 
94 980 

agagcacagt tcaaaggcaa tgtcctcaaa gaagacactc caccctcctc ccatttcctc 
9504 0 

95100 t9CCt aaaaataaga aggatacgcg gcctatggca aaccttgggc aggcacgtgg 

gagctgagct cttgcaaagg gcagatagtt cctctggtga gagagaaaag gaagggccag 
95160 

9 9 22? a9t9a aggaagagac g aa <= a gagag cccgaaaggc tgagaacgtt gtctggcttc 

oco aaa9gCt taa gggg fcta gctctggagg gtgaactaaa agccctagtt atattaaaca 
95280 

cacacgcaca cacgcacgca cacacatgcg cgcacacaca cacacacata cacacagttg 
95340 

oc? gagaCCt g ca g tttcca aaaacaagag ttgtattttt tttgttcata tcatgaccca 
95400 

95460 atCtC aaaagagaaa caatctcttg tcttccttgt ttaggcttag gagaacctgt 

9?520 9taa9 ° agCagcagc gg aa ctcaaa ctcgactctt cctactgtca ttctctctat 

o a c a « CaCaa g g catca g a g gaccactaga gtcgcctccc tagggttagg gttagggcaa 
95580 

ggtaaatgaa gtgagtcagc aagggcagga taggaacctg tctttattaa cattttgata 
95640 

9570? tttat Catggatttg tfc g ca ttaa-t tgcaactttt aaaaatcatt gcattaaaat 

o^^ tgatC tt: g attact g agtttttagg tgtaccctta aatgttgcac ctctgactta 
95760 

95820 CtCaC CCtgatGCCt g tcct ggatc tatgcctgtc tgttctatat cagcctcttg 

ctttgaccat aagaataact tcagaccttt aagcatagag gaaataggat ttctgtctcc 
95880 
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cttccccacc tttgtgataa tctcagcttc 
95940 

gcctactatg ttcctcccaa ggtcactagg 
96000 

cacattagca caaaggatcc actattcctg 
96060 

ctgactccaa cccttcaata gcctgggacc 
96120 

atttcccctt tgactgggag aaaaagggaa 
96180 

aattctcaca ggtggaaatt tctgacaacc 
96240 

aatggggaca gtagctaaca tgcaacctgt 
96300 

tgcacaccac cagggggcag cattctcatt 
96360 

tgtgcaacac tgcagctttg ctttagtgta 
96420 

tatagaatca gctaatctcc catgcagtta 
96480 

gcccagggac agcaatcttt cctggatttg 
96540 

ccagaaatat gggctttctc agagcttgag 
96600 

gttttgactg catttgaccc aagtcctgaa 
96660 

gaaggttttg tttggaccca gtgtttcacg 
96720 

cttggatcaa atttgttctc tcaaaataag 
96780 

gagatccaac atctctaaag ccatgaatca 
96840 

gggaacagat acactgttaa ataaaatgat 
96900 

aaggggattt atcaacttca gggcacagca 
96960 

atatatatat atttacttct cttgacttag 
97020 

cctgtctccc tcaccccagc ccccttgaag 
97080 

agctgtaagt aactgagtca catgcagaga 
9714 0 

gttgaagatt aaattcccct ttgagaataa 
97200 

aagaaggcca gatcctccct ctaatgtata 
97260 

gtcatacccc cactcacatg aatacacaca 
97320 

agggcagcaa agaaggaggg aaatctttat 
97380 

gaagggaagg gaagggtaag aggaagaatt 
97440 

gggcataatg aagataattg aagtgcagag 
97500 

ggcaggccag gcatggtagc tcatgcctgt 
97560 

cggatcacct gaggtcacga gtttgagact 
97 620 

gactaaaaat acaaaaatta gctgggttta 
97680 



tgcttttaaa 


gtctatctcc 


caagtagttt 


ttctgtgaaa 


ctagcagcag 


gctagattgt 


cagccgagct 


gggacaagca 


cttaggccca 


tacgttgtct 


ccaggtggta 


taaaacaaga 


gaactctaaa 


ttggaaaaca 


ggtcatctcg 


cctttgggac 


ccacaattca 


acacacccca 


aggctgttct 


gtcatccagt 


gccactgtgc 


ggcttctatg 


tgcctggagc 


ccagtgcagt 


gtccctgatg 


ggttcagtca 


agaaaatgtc 


agtctctaat 


tgaaatattt 


tctctgctca 


ctatttacaa 


ggatctctag 


aaattatcca 


tggacaggga 


attaaggtgg 


aaggcagggc 


gagccagctc 


ctctctcttc 


ctaattatta 


tgtatacaat 


acaaacttct 


ctcttttcta 


attcccagca 


gtgagagaag 


acaagacaga 


gataaccagc 


cacttgttct 


cttcagtgct 


tttatagatt 


cttctcactg 


cctttccaag 


atcatttatt 


cccagactac 


tggcatgcat 


aaaaaagaga 


gaattggagt 


tgtgaatatt 


tgagtcagga 


caaacttggg 


gcccaaatgg 


tgaaaccttc 


acagacccac 


tgatatggag 


ctgggtaaca 


ctcatacaga 


gactactttc 


gtgcaacgtt 


cctaaccctc 


agcccactcc 


taagcagtaa 


tataaagcac 


ttcccaccat 


tatggaagag 


tggaaggaag 


gaagggaagg 


ctcagggtga 


gcagaggaat 


gacatgtttg 


4— 4— 4— r*t 4- a 4- e~t o 

LLty Latyya 
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aaddLLtyaa 
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daiducagyL 


aatcccagca 


ctttgggagg 


ccaaagcagg 


agccgggcca 


acatggcaaa 


accccatctc 


gtggcgcatg 


cctgtaatcc 


cagctactcg 
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ggaggctgag gcaggagaat 
97740 

catgctacta cacttcagcc 
97800 

aaaagttcag gttgcagaga 
97860 

tagacattac agagagtttc 
97920 

gtaagaaagg aaggggagaa 
97980 

ctaccctaca aggcatgaaa 
9804 0 

taaaaaatta gaaattccct 
98100 

tttgaggttt ttggcactgc 
98160 

tgcagttacc tcagatgtcc 
98220 

ggaaagtggg taattaggaa 
98280 

cccagacatt tcagaattaa 
98340 

aacatgtcca gcagtcagct 
984 00 

taggggaagc cacatcctca 
98460 

agaagctgca tgcagcagag 
98520 

ggagtctgtg cacgttctca 
98580 

gtgtagggta gttccaagag 
98640 

ccttccatat tagttgtctg 
98700 

caatacccat ttattatctc 
98760 

gttctctgtc caagttctcg 
98820 

ctggaggctc tgaggacata 
98880 

ctcttgtggc ttgaggttgg 
98940 

ctttgcacct acaggctgcc 
99000 

agcagcatgt agaatctttc 
99060 

cagagaaagt tctttgcttt 
99120 

tccctatttt aaaggtaact 
99180 

tacaggttcc aaggattggg 
99240 

cccactcacc cataatcctt 
99300 

ttttgaataa gcacatttat 
99360 

atgtggcaaa ataggctgta 
99420 

aacacattgg gagaatctga 
99480 



catttgagcc 


tgggaggcaa 


tgggtgagag 


agctttcttt 


tggatggatg 


gatggatgga 


caattcttag 


gatgaattgg 


taaaattttg 


tgattttaaa 


accttaaaaa 


aaaaggcatc 


ctttttgccc 


tgacctttgg 


cttgcctgtg 


atcatatcct 


ctttccctct 


agccacaggt 


agcagaggag 


tacccatggg 


cagaatgagc 


atcaagtcct 


ctttactgtc 


agtagagaca 


gtaggttatc 


tctgatgagt 


ggctcaaagg 


agggtccaga 


cacgcacccc 


gaaacactct 


atcatttagc 


tcaggttcct 


ttgctgtgta 


gcaaattgtc 


gcaagttctg 


tatctcagaa 


tgagactgaa 


atcaaggtgt 


tacgcttcca 


accttattca 


aggtccccgt 


ttccttgctg- 


tatgttccta 


ttcacaagac 


ttgtggctcg 


tatctttctg 


taagcgttca 


tgcgattcaa 


gtgataccgt 


ataacatttc 


gcagaacatc 


tttgggggaa 


ttaaaaacca 


aatcttgaag 


acctaacttc 


atcagacacc 


aatcaatcag 


aactattctt 


cactgtcagt 


ggtataccag 
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aggttgcagt gagtcgagat 
tttttctctc acaaaaaaag 
tggatggatg gacggataga 
aatccttaag tctttattct 
atattttcta ccctgtagag 
tactttaaaa gaataatgtc 
gaaacagagt gagtgatcct 
gaaccctagg tccataatca 
aacacgctct ccaggcactg 
ctgtgatgcc cagttataaa 
caaatgggtc tacatccata 
aaatgttcct acactttccc 
ccagctagtc acaggtatgt 
atagatacca aagcaaaagg 
ttttgttcac aaaatagatg 
gcctccataa aataaataag 
agaaacgtag aggcttaaag 
gtccaggcag gcttgactgg 
tggccaggct gggatcttat 
ggccatcagc agaatcccgt 
gctgtcatcc agggaccact 
accgttcatc ttcaaaccaa 
gctttccctt cttctttagc 
tcaggcccac ctggataatg 
aggagtgata acagcacatt 
cattttagaa actctgcctc 
cctttttttc ccaaaggcct 
cactttgagc aaacactagc 
tcccaccaca atctttctca 
agcagactcc taccatctca 
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caagagctga ctgttaaatg tttagtaatt gtggacattg gttgttaaac tattagtagc 
9954 0 

ctgaaattga ctatagtgag agtattttca ccatggaaag caaccgttcc aaatcagggt 
99600 

ttctctttat tcctgggaag ctggtttatt agctcaccac tggctgtagt cctttagggg 
99660 

tcattacttg acctcctgta gcatgcagga atcctctcca tggccttttt tatgcatgga 
99720 

catcatccta ttttttaata ccaggaatgg ggtgatcact ctcttataag ctagttcatc 
99780 

tccctgatgg aatggtatgt ggtagagttg aaacccacct ccctggaact tcccaccaac 
9984 0 

ttcctttgga agcagcactt gtgacagccc cagaaccatt tggagtaagt agcatttcct 
99900 

ccaggagaca tctctcctct ggatccacaa atcaatagtt agatgcaaaa tctttagagc 
99960 

cacactgttt gaattcaatt cccagctctg ccacttattt agttataacc ttaggcaagt 
100020 

ctcttaactt ttctggtcct ctggttcttc atgtgtggga atggggataa aaatagcacc 
100080 

tacctcatag gttattatga atattaaatg agataatgtg tgcagagaaa atagcacctg 
100140 

gtctggcctc tacctatcta acaggttagt tgtgaggatt aaattactta atataagcaa 
100200 

aatgcttaga gctctgccta gcacaaaata agcactatgt aactattggt aagttaattt 
100260 

gaaatgtggt ttctagatct ctcatcatcc tagtcaccct actctggatg tactccaaag 
100320 

tccctctcaa gatatagtgt cagaattgac ctaattagtc cagcatttga ctgaaacgct 
100380 

agactttgac tccagceccc catccttgac tggcactagc attcaagccg cttctcctct 
1004 4 0 

ttccctgggt ctttaataga gtcagagcga cttctccagg ggatcttttg gccatggacc 
100500 

agtagcatcc acacacgctg gggccttgtt aaaaaggcag gctctcaggc cccaccccag 
100560 

atctactgaa tcagaatcca cacattaaca agatgcttgg gtgattcatg tgcacattaa 
100620 

agtttgagaa gcaccgcttt cagggacgag atgacacact tattttaaag agaacgccaa 
100680 

ttagagaccc taagccttct catggaacag gggccttccc ctcagacctt gggagagggg 

1007 40 

tcagggaaat atcagtgttg ggttgttggt gacaggtggc ggtggggggt tcagtccacg 
100800 

ttcaaagagc cagaaacctg gcaggggaag agatggggca gtgacaccca accggaaaaa 

1008 60 

taaaggaaac tacaagaaga acccagctaa gagatgtgag gcttctgaaa gctcccatgg 
100920 

aaaggttcgc agctcctcca cctgctcggt ccagctgccc caggtcaagg aagctctgtg 
100980 

agtgttagct gacccggagc agcaaggata cattcagaag tgatgaaagg gaacgcttct 
101040 

tgacagggta aagagtcatt cagtaggaat gagacaggaa gaggtcacag agtcagaagc 
101100 

ccagcctgta ctcagagatt atttctggca tgggagggcc gaagggttag gaggccacct 
101160 " ' ~" ~ 

actcacaata caatacagag gcagatccac ttattacctg cctgtgctgc tgggatttca 
101220 

gtgtggaaat tctgtgcctc ctcactgtgg ctgcagcttg ggaatgacat ccagagctta 
101280 
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cccacctgca 

101340 

tcagctgctt 

101400 

ctcatcttta 

101460 

ccagcacctt 

101520 

tggtgacacc 

101580 

gtagtcttag 

101640 

ctgtggtgag 

101700 

caaaaataaa 

101760 

aagaggcctg 

101820 

aggatcactt 

101880 

taccaataat 

101940 

tactcaagag 

102000 

atgatcacac 

102060 

ataagaagaa 

102120 

agcatatata 

102180 

taatcatctg 

102240 

tggctaaaga 

102300 

tcctcgatgc 

102360 

atcgtctcca 

102420 

caggtctgtt 

102480 

aaggcaggtg 

102540 

ggataatctg 

102600 

tggggggccc 

102660 

ctgttggaga 

102720 

aggtttgatg 

102780 

tgaaagtagc 

102840 

gtggattcat 

102900 

ttatgtttga 

102960 

agtcagatag 

103020 

cttgaattgg 

103080 



taagaaataa 
aatagctgtg 
aaatggaagt 
ggaaggccga 
tcgtctctag 
ctactcggta 
ccagattgtg 
taaataaaat 
acatggtgac 
gagcccagga 
aattttttaa 
gctgaggcag 
cactgcactc 
tgaaacaaga 
ttatcttaga 
ttcaactaat 
gaacccttcc 
aaaggactat 
ggttgtttga 
ctgggaacgc 
caggtcctgg 
tccccatgac 
aaaaaaatct 
gagcttcctt 
atactgggtc 
cttcaaaata 
tttacgaagt 
gggcgttctg 
acaagaactt 
agactgcctg 



gctataggtg 
cgactgagca 
aatcataatt 
ggccagtgga 
aaaaaataca 
ggctgaggtg 
ccactgcaat 
aataatatct 
ttatgcctgt 
gttcaagatc 
aaattagctg 
gaggacggcc 
cggcctgggt 
aagttcttct 
attcttcctt 
aaatgctgct 
cctttgcagc 
ttcaaaaggc 
tagaagatct 
tgtgtgcatt 
agcctctgac 
cattaagtaa 
gaaaaccctc 
ggagtagaca 
tgcagcttct 
tttttgtctc 
ctcaagtgtg 
tatgccaggc 
gcttgatctt 
gagatactgg 



taataggggg 
agttacttaa 
gaccaggccc 
ttgcttgagc 
aaaattagcc 
ggaagattat 
ctagcctgga 
atgttaataa 
aatcccagca 
agcctgggca 
ggcatggtgg 
tgagcacagg 
gacagagtga 
tatggttctc 
cctgtataaa 
tactcccact 
accctgagga 
cagccaggca 
cctgggagca 
ggcacctccc 
agcattactg 
ctgccactgt 
acttgaacca 
aatgtggtat 
ttgtcccact 
taatgacata 
ttagaaattc 
actgtgctgg 
ggaagtaagc 
atgctgcaag 
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acataggcta 
cctctttgag 
agtggctcac 
ccaagagttt 
aggcatggtg 
atgagcccgg 
gacagagtga 
agcagaaata 
ctttgggagg 
acttagtgag 
catgcacccg 
agttgaggct 
gaccctgtct 
atggtggtga 
gaaggcctcc 
ttcactctaa 
tcagaggcct 
gcccagacat 
ggtttccgca 
ttggcagaaa 
gctctaggag 
gcgggaagaa 
gtaagttata 
gttaagtaaa 
gaaaatcctc 
tttttgctgc 
accatgagtc 
gcactgggac 
agggtggggt 
cttttgaaaa 



aaatcctagc 
catctgtttt 
acctataatc 
gagaccagca 
gcaggtgcct 
gaggttgagg 
gactgtgtct 
agaatgaaat 
tcaaggtgag 
gtcccatctc 
tggccccagc 
gcagtgagtc 
caataaataa 
gcacaatgta 
tccaatgtat 
aggaactcaa 
gatttgaatg 
gtatttccta 
gcagctcagc 
gcttggagga 
tagctgctca 
gaactggaaa 
ccctgggttg 
ctggggatct 
gggcattcca 
aaaaagatga 
actcagcaag 
tactgtagca 
ctggttagtc 
aagacaagtt 
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ctctgtactt 

103140 

gactgaaagt 

103200 

agaggtgata 

103260 

agcgtcccag 

103320 

gaggactggg 

103380 

ggtctgaggg 

103440 

tgtgtgtact 

103500 

cctggctata 

103560 

tcttaggagg 

103620 

ttgttttgtt 

103680 

acaatctcgg 

103740 

tcccaagtag 

103800 

gtagagacag 

103860 

ccacccgcct 

103920 

atgatggtgt 

103980 

gctggggaag 

104040 

gaaaactggg 

104100 

ggaaagaaaa 

104160 

ttgagcagaa 

104220 

caggcaaggt 

104280 

gagaatccta 

104340 

taacctaagt 

104400 

agatgtcaat 

104460 

cagagcttgt 

104520 

ccttaacatt 

104580 

acctttcatg 

104640 

ttattgagtt 

104700 

tgactctcaa 

104760 

tgaaactttt 

104820 

agaacatcac 

104880 



gcagagctta 
ggtggagcca 
agtttactca 
gcagagggga 
aagaccagtg 
gt cagcagcg 
acaattacag 
a tgtggggaa 
ctactqcaaa 
ttgctt t?*g 

CtCACt^.T'«ia 

ct gna v* ac 
agt 1 1 jcci'ti 
cggcct r era 
1 1 1 g a t c t g *j 
agagg.7a^gc 
caatttgtta 
ctcaagat ct 
gtaaggact. t 
aataccataa 
taccctgcct 
tactcttgcc 
caaaacagca 
tgctgatggg 
ccaaccccct 
acctgatttc 
tccctagttt 
accaccaaac 
aatagggaac 
aaagtaggtg 



catccagtaa 
ggagtcctct 
gagaegcaaa 
tagcaagggg 
tgtctaggac 
actggatcat 
gatgeatgat 
gggattgaag 
geccaaggga 
aaggagtct'c 
r~r -cgcctc 
aq : :at ::cac 
tgr r j jtcag 
aaa:j rtggg 
<jt : jaagg 
ctq^cjcigtga 
gatqatggtg 
ggctggtgac 
tctccagtgt 
gtcccaacag 
cctccattca 
actagttaac 
cgagccatgc 
ggcacagact 
ctaatagccc 
tatttccagc 
tctgaattaa 
tgggattgtt 
acaaaccctg 
ctattatttg 



ctaactaact 
agataaggta 
cgatcaggat 
attgecctta 
acataagcaa 
gcaagctccc 
aggacctggg 
aaagagggca 
gaggtgatgg 
actctgtcgc 
ttgggttcaa 
caccatggct 
gctggtcttg 
attacaggtg 
cagaaggaag 
ggaaagaatg 
ccattgacag 
ttgtattaaa 
tcaagagctg 
tgatgccctc 
ttccttgctc 
gcctgtcctt 
tatgtcacat 
agattttgag 
atttaggatt 
ttcaaccacc 
tgactgaaga 
gttactctta 
cccagaaatt 
accttacacg 



aacttcaggc 
gccatggaag 
aagcacagac 
ggtgggaaag 
ggggaggacg 
ataggecatg 
ctgcattttt 
aaggcaggaa 
tgttttgttg 
ccaggctgga 
gcaattctcc 
ggctaatttt 
agctcctgac 
tgaggcaccg 
gggggtagta 
aggggtgatt 
aaatgggaaa 
ettaaagect 
gaagggattt 
cctgggaatg 
tgatggtggt 
atttctcttg 
gacatgttgt 
agaaatctct 
tatcatactg 
ccttgggtca 
tgataagctt 
gtgataatgg 
catataaatt 
tgagacttga 



tgtgttgagt 

gcctctccga 

cccggtgaag 

ggcttgattt- 

ttatgaacga 

gtaagggctc 

aatagttaac 

caggaaaatc 

ttgttgttgt 

gtgcaatggc 

tgcctcagtc 

tgtattttta 

etcaagegat 

cgctggccaa 

aattaactgt 

ccaggtttag 

gaacaagttt 

catttgtgac 

ttctagcctc 

atctcaatgg 

tctggctggc 

tccccaccta 

ctgtccagcc 

ctgttaccac 

tttcatccaa 

ccacctgtac 

cccttacata 

ttgetattta 

atttcattta 

agaactttag 
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agcattgccc aaggtcaccc agctagtgag 
104940 

ctgtctccat tacctggaag aaggaaggcc 
105000 

gccacgggct ttctacggta gccagccacg 
105060 

tgatgagggt ttggagttca tgtggtgggg 
105120 

gttgcattca cttcaggagc aaaggaccag 
105180 

actgcaggct ctgtagacat ccaaagctct 
105240 

gggaggttgt acccatttag aatgtaaaga 
105300 

actgaaggcc tcagtctcct ccaacaaagc 
105360 

tgcaggccca tggccactgc ccagggctcc 
105420 

gggagaccca gttccacacc cacccaccta 
105480 

aaacatcttc cctgggaaat ggtcccaaaa 
105540 

gcccaggtct acatctgttt ttaaagctcc 
105600 

gaagaagaaa tgcagaacag ggtgactaaa 
105660 

acaaactaac accttttaac atgaaaagca 
105720 

aggactgaga aaggaatcgt gattctgaga 
105780 

atccctgtga ctgcagaagc ctgtttggag 
1058 4 0 

gaattgccag aatgtgtgac atgagccaaa 
105900 

accacccccg acccaaccct cccgccggct 
105960 

agagaagttc tactatttct gggcaagatt 
106020 

catgaaataa accatttgga tcaacgtcag 
106080 

agcaagactg ataaggaagc aaacatggga 
106140 

gttatttcat tgctctcata actgccctgg 
106200 

ttatcagtta aattatgtat ccaagattac 
106260 

ctcggctgtg aagcttgagc tctttctttt 
106320 

agcacaaagc ccaggaggag cccagtgagg 
106380 

ccgtgcccct gtggtgttgg ggcaaacttg 
106440 

ctcttcccgt ttttgtctgc tggctgactt 
106500 

ttatttttcc atccacagca aaacaattca 
106560 

gaaaaacaag aattacttca ttgagatgat 
106620 

gaaaggtgta gtaccctgtc ctccacacta 
106680 



gggtggaggc gggatttgaa tccagctcat 
agagcatcat ggcctttcac aagttgaaga 
cttttccatg actggggtgg gtgtggcaag 
tggcagggac caggtgtctt ggtaactgct- 
atctgattct gcaggatcaa caatatggac 
aatggtgact tggggaagct caggagggca 
ttcctatttt ataaaaaaga aaaaaaggag 
caggctgtgg ggtagcagag tctcaaaggg 
tgctcaggcc tcctcactcc cacaactgag 
gcagtgtctc acacccaccg ggagaggtct 
tgtccctgca gtaagcaacc atctggagag 
aataaataaa taaatgaagg aagaaaaaaa 
attggcatgt atttttaaat gtttatatta 
atataattgt gctagccaca aaatcatcgt 
gccctagagt taatgtgatc cagctggctc 
atagtgtcag tagcttttca ggccctctgt 
tttcccccca gcatccccgc cgccgccacc 
cccatagaat agtcactgcc atacagaaaa 
tccacaaacc agtttgtccc tttctgcttt 
ctgattgcaa aaattttccc ttgtctcaaa 
ggaccttagt ggccgagcct ttatgtgtat 
gatgctgtaa gcatgattca tcctgtttgt 
acagcctatc caggattaga actcagagcc 
cagtcttcaa atatgatcat gccatgaagc 
ctggaggggt ccactggcag ccactctcct 
gatctttctg aatcttttaa ctgtttcctt 
gtcctacact ctactccttg cttatgatac 
catcaaggta attgatgatg aggcatatga 
gggcccccgc atggtggata tgagttttca 
acactaacat tcttctctcc tcttctgttt 



63 

3DOC1D: <WO 0233086A2 I > 



WO 02/33086 



PCT/US01/32152 



cttcctctcc aacccatttg tctcctcctc 
106740 

cccttgtctc ctctcttgct ctctctcctg 
106800 

ctctctctgc ccccagctct gtcctaacac 
106860 

gggatgctca agaccgatgg taattgttct 
106920 

gagccaaaat gctggagtat catgtgcggc 
106980 

aggggggaaa agggaccaca tagcccagca 
107040 

ctttcagcac aaatttccat ggggcagtta 
107100 

gttgagaatc aatgtatttg gattacagtt 
107160 

ccaccttatc tgccatgttg ctactatcaa 
107220 

ctgaaagtac aggtttgagt tccagatcta. 
107280 

ttcctattaa gaacacccag gtgtggaggc 
107340 

tgacccaaat ggaagcttcc ccaagaggtc 
107400 

tgccttcaac ttcaatgacc ccatacatcc 
107460 

ctttcctgaa tagatcatac tgtggagcag 
107520 

tccccttcca agctgtccct catgccctcc 
107580 

atcccatccc acagcagagt tcctgcagca 
107640 

ctggggtccc atatctaggg ccaacagaaa 
107700 

atgagctttc tcttttgtcc gctgagcaag 
107760 

aaaggtaata aaaatgtaca gtcgtgcatc 
107820 

gtgtccttaa gcaattttgt catcgtggga 
107880 

atggtgtagc ctacaacaca cctggactat 
107940 

gtacagcatg tgcttgtact gaatattgca 
108000 

ctaaacacat ctagacatag aaaaggcaca 
108060 

gaccactatt gtagatgtgg tctgtcattg 
108120 

tacttgtaaa gtacacacac cacaaatgca 
108180 

gggtcagtct actacattat aaatggcaaa 
108240 

catcagagga tctgaagaga catttaggta 
108300 

cttaggcctc ccctccccca atctcccccc 
108360 

aagcagaaaa ttacaattct ggctcactag 
108420 

tgaaggacca gaagaaacca tataggaaga 
108480 



ctcttgtctt ccacctctct ggttcccttt 
ctctcttttc actcctccct ctcctctgtc 
ctgcca'gcct gacacatggc atccatacga 
gggataagga aatgagtatg gggaaagaaa 
tcttggcttc tccagaatgg ctgggcataa 
ccagacagaa gagcagcact gagaaacagg 
ttctcagggc taaacttaga gtcccaggaa 
cattcccctc ccaaaagcag gctttaggag 
gacttgtttc tcctcctgac cttgaggaag 
ggtcaaatat ccatttgtct tcctatgttt 
agagagttag aatagtggtg gagatcatcc 
catggggctt ctcagagtgg atggaatctt 
catggcctcc aatagacaag tcaagaagtc 
ggagctgcca gtactgaggg caatgttcct 
agtacatgcc tgttgtcaca gagcacccca 
gagaaacagg ctcacacctt gtagacagcc 
tattcccaaa aaaatgcctc ttgacaatca 
gtataaaaag atgtcaaaag aagtacccaa 
acttagcaat aaggatacat tctgaggaag 
aaattataga gtgtactttc acaaacctag 
gtgggcctat tgctcctagg ctacaaacct 
ggcaactgta gcacaatggt atttgtgtat 
gtaaaaatat cgtagtatat agccttatgg 
agcaaaacgt ttttatgtag catgtgactg 
cagcaagtcc tgtgccctac aagccccttt 
gccgagcacg cccacagaag gtagcaggaa 
aatgctcttt accctttaga gcatttagtt 
cgccccccgc caaaaagaaa aagaaaaaga 
taggacctgc tagccaccat tgtgattcca 
atcaggccca cacggcaacc tctccacatg 
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acaaagagcc agtctttgga gggcagtgaa tttcaaggaa agttttcttc cctgqqtaac 
108540 

ttgtttttaa aagatgttat gttttgttga gatacccaga gatgaacaga aacttccatc 
108 600 

accttgtgcc ccagacccat gataattcac attgaggaaa ccagttttgg aacacatcac 
108660 

ccctaagtga tagaagccca aaggtgattt agaatttgat gatttacatc attttcttca 
108720 

cattttccca gaaatgcatc agctgtaaat agtaaaggat tcctatgtaa tattqtqqtt 
108780 

aatacatatt tattttagtt cccaccactg aagccctatg agataaagaa tgagaaaqat 
108840 ~ ~ * 

cacacaattc tacctccctt tcttctctct ctctctctct ttctctttct ctctcactct 
108900 

ctctctctct ctctcttctc ttcctctgtc tggttttcct tcctcataaa tacttttctt 
108960 

ttaaaatttt ctrtctgaaa ctcacaatgg aagtgagtat agacataaag aaggqacaca 
109020 

agccctgggt tctg-.taaca tattccctgt tgtgggaaga ccctgggtta ttcccagtgg 
109080 ~ " "* ~ " y ^ 

gttagtagtt ta^ctgt t .;c ccu-iaqaaat gccactgtta tcatgtgaca cccagtqqaa 
109140 * ^ 

tgtgctgcct gactc.»:?t: cta-taactg ttggcaaggt ctaaaatgac tcctcctcac 
109200 

cattacccgc cttctqc — ct.cetcccct tctgtccttc tggctccctt cctttgccca 
109260 

cctttccttg cctcctaart ccctaccccc tcacccgtaa gaacaactat gaccaagaag 
109320 ~ 

acaagaaaaa ctaag<i.:cat ttattacctg agaacaacac aatccaccat gqtcctqtta 
109380 yy y y 

109440 aCCa tggtg9:?clCt a^^^tgcatg tgccaggaat gacggggaat gattttaaag 

gctgtgctcc aggtgaccaa ccaatctacc gacccagtcg acacactctc tctcttqtta 
109500 ^ ^ 

tccctacagg aaaaccacaa gggttaaaat agtagatgag gaggaatacg aaaggcaaga 
1095 60 

gaatttcttc attgcccttg gtgaaccgaa atggatggaa cgtggaatat caggtqtqag 
109620 ~ "* ^ 

attctttaaa aacaaaacaa caaaaaaaaa gaaagaaaaa ttaaaacaaa ctgaaaaaca 
109680 

acaacaaaaa agaaaaagca gctatatttt tgtctccctc cttttcttcc cttctcctcc 
109740 

tttctctttt tgaccaatgg atttttttat tcttttccct cctgtattct cqctctcacc 
109800 

ctgtttcggt atcatctctg ccttcttagc cttagcttat tccaaattcc tcctttaccq ' 
109860 y 

ccttctgggc agcactgcag cctcaactcc tcattaccct aatgagttat ttccctqttt 
109920 

tgctacaatt ttcaattatt caattgccat gggcccctgc actctccccc accccacccc 
109980 

tacactgtaa cctgtaaatg tgaaaattcc ttggtgggtg gggaggagaa gaaaaaaaag 
110040 

gaatgtgatg cgatgcatgc -ctgtgcccct tcctgccttc ctcccctgcc acccctcact 
110100 

ctttagcctg gattgaatgt gggggggtct gggatggggg ttggggcctg ggttgcaatg 
1101 60 

atgctttgac agttttctgc tgcattcccc aacttccttt gaacgcttgg caqgttattc 
110220 

11028o" ag t9gCCCatag 9 cccctc tgc ccttcgagga ggtaagtgta ttttctggct 
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gtttcacagt 

110340 

tctgagagct 

110400 

gctattct tc 

110460 

gggtgcctca 

110520 

cctgaaggga 

110580 

ggactgacat 

110640 

aaotgatgca 

110700 

cct t tqcttg 

110760 

cactagasaa 

11OS20 

aggggct cat 

110880 

ct tccct tga 

110940 

catagatgtg 

111000 

ca ttcaggaa 

111060 

tttgtcaaaa 

111120 

taaaactttc 

111180 

tctacaggcc 

111240 

gagctgacca 

111300 

tcctctcctt 

111360 

ctgactgcca 

111420 

gtattcttgc 

111480 

cctgccctac 

111540 

gaagcagcag 

111600 

tacttagatg 

111660 

agacaagagg 

111720 

tctcttatgc 

111780 

tcttcaagag 

111840 

cctgttttgc 

111900 

ggtgagaaga 

111960 

gaccgagatg 

112020 

tttattgtct 

112080 



tgggcagacc 
ctgagagaga 
ctcctctggg 
ttcaggctac 
aggaaatagt 
tgggcaggcc 
gttggtgctg 
gcatctgcaa 
qtggcctcat 
ccctaaagga 
agaaaatgat 
ggaatgggtt 
agatatgctg 
aggggtgatc 
cacacgagga 
ttctcagtgt 
cttctgtgcc 
cttcactgct 
aaaagtgctg 
ccaacttcca 
ctcattccca 
atacccaagg 
caaacgaaag 
aagagattgg 
ctcccttaaa 
acctaaccaa 
acagattctt 
caaggatttc 
ccaagatgaa- 
atgttttcta 



gtggcatggg 
gagttgactt 
catcccaccc 
tttgggacaa 
tccaggtatg 
tggagtgttg 
atccctgaca 
gagaaagtac 
ctcctgcttt 
ctgattatgt 
tggaagccac 
agggtgctcc 
ctgggagcct 
atactgcaat 
cagagctaag 
gattggtcat 
tgcgctggtt 
ggttctcttc 
gaagaaagaa 
gttcttgcca 
cctcccttga 
ccaattgtaa 
cagcacatgt 
aagtccatac 
acctctacca 
atgcaaatgt 
tgcccaagga 
agcaagctat 
gagtgccaag 
ggcggttaat 



aaagtgtacc 
ctggggtaat 
catgccattc 
tgcaacctct 
aaaattcccg 
acttgtggat 
gacaggtgtt 
cgcccagatc 
cctcaggacc 
tgcaaccagg 
tactttgaat 
tgggaaataa 
cctagcaaat 
tagttcatat 
agattcagca 
ttctcattgt 
tggacacacc 
gtcaccactc 
attatctctg 
ccaacaatat 
gcatttattc 
gtcaccttca 
acagcgtaca 
atgcctttat 
acagcaggac 
ctttggctaa 
aatgtcagca 
ctgtgtggtg 
aagaaattgg 
ttccagtttc 
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aattgtcaga 
catgcaatct 
tatgttccta 
aaagcagaaa 
tagccagggg 
tttccaacag 
ggaaaggtca 
ccaagatagc 
tgcatctgag 
gtagaagtaa 
ggcttccaat 
caagaggacg 
gaagcagtga 
tcatgtgaca 
acaacattcc 
ctgctgggga 
tgatgctcta 
aataaaacgt 
gttctattgt 
tctcagaggt 
catgtattca 
tcagtttcca 
ggaaggaagg 
tccaccagta 
agagagtgac 
tccccattta 
atgccctcgt 
tgcccccaga 
tcaattttcc 
ttcagtactt 



agccacggct 

ggaattctga 

gcccaaggtt 

attgagagtt- 

ccccagaaaa 

aagagactct 

cagatgtctg 

cctcatccca 

aatacctgcc 

ggaaggattt 

catttggagg 

ttcacactcc 

aatccacctg 

aagagcagca 

caaaggattc 

ctctcctgca 

ggggcagaac 

tgccctcagc 

ttcccacatt 

tgcctcagca 

taattggttg 

cagtccaagc 

cagtggttcc 

aaaaggctct 

cc'aagataag 

aggacatctt 

ggagggagta 

tctccccagt 

agctgcctat 

cccgtatttt 
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gacattagac cataaggtga aaggtcataa aacctgattg tctagactca gaagcaaatq 
11214 0 

gaaacccatc caaatttcca gaattccctg ctgttctcag agtgagaaac aqaacaqtqa 
112200 " y yy 

aaattgcttt tcattatcac tactgcatgg gagagtctga aacattcaga atggcataqt 
112260 ' " y 

ctttgcatgg tcaaaatgac aattgcatta aaaaaatgag agactggatt tgaaataqqa- 
112320 yy . 

gactctattt ttggcaaaca aaacagactt cagagttgag attaaaagct ctggatqaqc 
112380 * * y 

tgggggatgg aaaaaaggga aggaaaaaag ggagactgaa taggaaacac agttgctctg 
1124 4 0 

gagtctagaa gtggacttcc gagagcaaca ctgagcaaca taatcaagac tqttqqacct 
112500 yyy 

gggcctggac attggaagcc ttcggataga aaggaaagct ctctgtctct ctctctctct 
112560 

ctgaagaatg gggcctgttt ggtcctcctt tttcgacaac cgtgggctca tcttgacaaq 

112620 y 

ctgcccagat gcttcctaat tactcacagt cctatgctct ttccagcttg tccctggqqt 
112680 ' y 

gtctgagcag gaataaatga ctctcacctg acccagggga tcaatacaqq qqaaaqttca 
112740 . - - .. 

gctccagctt ctctcatgag cagcagcagg aaaaacaccc tcgaggtatt qtqtcaqtca 
112800 y 

112860 9CCt aCCCaggtct tgctgaccca tctataactg ctgagcagaa agtcttggat 

tcatggagac aatgaceaga gaatgatgga attccagcca actgcaggcc ttctcactac 
112920 

tctagggatg ggccagatgt tcggtggcat gtatgagtga aaaccagggc atcagggacc 
112980 

tttctggaag agctgccttt gtctgaccca cctgtgttca tttatgtgct gggatctctq 

113040 " y 

^ctcccctg gaacttgggg gaagctcttc cacgcaaact cccggaagga gcagaataaa 

caagctcttg cctatctatc tatctatcta tctatctatc tatctatcta tctatctacc 
113160 

tatctgccta tctatatcta tctatctcaa tgtagtgagg aaagccattg atccattaac 
113220 

ctttggaatt ctacatggga gatacctaaa aaagtgaact gccttgttta tgtatcatqc 
113280 " 

agactctgga tccacatata tctcagtggc tgtgaatata ggatgattga tcacaggcct 
11334 0 

gagttgcatt cctacagatt cttaggaaaa aaattgattc acagacatgt cccccctqqt 
113400 " 

tcccccacaa cacacactcc ttcctcagca atctctatca gtcaccaact acacqttqaa 
113460 y 

tatgtggcaa gctcttccca gacctttatc tgagagccaa ggagtgaggg gctgtactaa 
113520 

gatatcatag aaatgaaaat gtggtgtgtc acaagtttcc ttaattctta gatcttaaac 
113580 

tctaagaggg ttcagcataa gtacaaattc aagggctaga gacaacctgt attggqtqtq 
113640 ' ~ ' y 

tctttaactc agtttcccaa tccacatagg gaccttgcat ttgtcatctc tcatctatat 
113700 ~ y 

atagctgttg gtatgacagt ttctctgttc cagaatacct gaactctgac ttagcctqtc 
113760 ' 

ctttctgaaa cagaaaaatc acccaaccag agatctatga gatctatgga aaaqacaqtt 
113820 

gccaaaatag acagcaaaca gccaaactta attgaacact accacatgca gggactttgc 
113880 
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taagcagagg tgatacaaaa tgggaggagc 
113940 

ggtaaagaca aaccattcaa ggaaaacatt 
114000 

cttttagggg aaatcaaagt tctagtcaac 
114060 

gaaaccttat gagcactgag aagctgagca 
114120 

acatagacct gggcactgtt ccactcttgc 
114180 

tcaagcccca tggtcatttt tgccactcat 
114240 

acacagtttt catgataact ggattctcac 
114300 

tggtatccta cacagtcatg agtgaccaac 
114 360 

ccattcctag ccctagggta gccgggaaag 
114420 

ggtgctcagc agcctctccg gctgctcaca 
114480 

aggctgcttt ttgcgtatgg ctgctgagct 
114540 

gctgccagca accacaaaac ctggtgatac 
114600 

tttctttctt ccatttttct ggtatccatt 
114660 

agaggtgtct tgtccccctc ttttctttcc 
114720 

gaccaatgtc ctttgattta ttgaataaga 
114780 

tcccaaggac tgggagattt attttcccac 
114840 

gaaggcaatt ccttgagcat cacttctgtc 
114900 

tctctgggac caggaagaga aattaagctt 
114960 

ctgagtctta attcatctgc catctcatgt 
115020 

tggaatctcc taatataatt gtgacaggat 
115080 

tccaggaaat aaccaaacct gttgtgagaa 
115140 

ttttaccagt cctggagatc ttccatctca 
115200 

tggttcattt gtcattgacc cttaaagtcc 
115260 

ggggattggg gcagtgttct ggctggaaat 
115320 

ttctcatgca gaatctaggt gctatagagg 
115380 

tgagtggaaa ccaatcataa acaacattca 
115440 

gcatatctag caatactata actctttgtg 
115500 

aatatatttt aaaagaacgt cttacatttc 
115560 

aaggccactt aagaggcaaa acatcttaca 
115620 

gctacatctt gggttcagga aggggcggtg 
115680 
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ccatagccct 


aacttccagg 


atatatctac 


ctgcaggact 


tacctttttg 


ctaagtcatt 


gtggcagcta 


ggaaggcatt 


tgtggtgatg 


tgagttcagc 


taagtcgtta 


gggatggaag. 


acaatgctac 


ccatttcctt 


gagctcccat 


aagttagcta 


ctctggcagg 


gttgcaactt 


tccttttttt 


acagaatgga 


tgtgataacc 


ctacccattt 


ggttccccat 


cctcattcct 


cataggagca 


aatgccctta 


ccagggccct 


cctcttgctg 


ctgctctgtg 


catgctccaa 


ctcacctact 


aagctctctg 


ctttccttat 


tttcaagatg 


ggacattaat 


gctctttcct 


tgcaaacagc 


gctcctgtta 


tctccaggta 


acttcttgcc 


agtgccatta 


tttggtttaa 


actgcaggct 


caagttaacc 


tgacaatttc 


atgaagcaat 


tatgagaaag 


caattgtgag 


tggggacgtg 


ggttaaggca 


tagctgatcc 


aacaaggaga 


tggtgggtca 


tagacttctc 


tgtgggggaa 


gagacagtga 


gattcagagc 


ttgaaaaaaa 


aatactttaa 


tcccaaggga 


taggaaatgc 


aatttttaaa 


gaatctggaa 


tcacagctga 


gacttaaatt 


gctagaattt 


tatgtgccgt 


gaacaagatg 


aattaggatg 


ataaatttta 


gagaatttat 


tttgaagaga 


acgtacacct 


actttgagag 


tatgcttgca 


acttcatgag 


cagatatgaa 


agcattttca 


caagcagagt 


ggcctacaca 


agacagtttc 


atcagtcctt 


tgaacacaga 


aaaaaatgtt 


gagttcattg 


atattcaaag 


tcacctacag 


tacatagtaa 


ggacatacgc 


cttctgggag 
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115740 a ° aa acaaaaaaaa tgtaggtaac tcctacattt ttcttttgtg gaaaaaacac 

agttactcca gcttccttgg ctttttgctt cttttttata ccaacaaaat aagggctatc 
115800 . yyy 

115860 CCtC tgttCttCat tcttctccca gggtattgat ttcataacat tgggtttttc 

ttctctactt cactcatcct cttgcctgtg aaggtatgta aggcttcttt gttccaactc 
115920 

tttcctccac ccgccccccc tcacataaat gcataacaaa gattgtgatt taatttaaqt 
115980 

116O40 CtaC ttttaacata tt:t 9caaaca tcaatagaag ctaaaatggg aaaaaggaaa 

^tttctttt cctagctctt tcaatctgta agcctttaat ttaggagcgc tgattagcct 

ttcaattcgt tggaaatctc aaatactggt tttaattttc ctaggtggac agagacagag 

^aatatgtt cattctgagc taaccacccc cccaccccca agctcaggcg ccttgcagga 

agagcactag ctacatcact ctgcagagtg ttcacaacat cctattcttg tctggcctqq 
116280 

caagctcttt gtccttccaa tatttgttca atcttccatc ctattcatat tctatctttc 
116340 

116400 CtCC CagCCtCtct tcct 9 ttc ct agaactgaga gtttatttag tcagtctgaa 

tatctagatc acctgccatt tattctcttt acttgaaatt ctgaggagtc acataaacaa 
1164 60 

gatatcagaa tcactatggt cctctaaatt gaagacttat aattctctca agaaattaac 
116520 

aacatttgaa tttaaaggaa agatcatgac aaaaatagaa aaaggcagga attattgcca 
1 16580 

aaccgagaaa ctagaaacta gaattaactt aaaggcatgt gactcaatca attaacaaat 
11664 0 

atatacagag agcctctgtg ggactgtggg agatccaaag atagaggatt ggttatttgt 
116700 

116760 9att tttgCagaaa 9 cta gatgga aaaactgact gtcaccacag aggtggacag 
116820 3a9t agatCaatat cct 9ccagat ggatatagtg ctagattgat aggtagacaa 

ggggttagac aggtacattt atatgtcact ggagagctca ttatattggt ataaagttat 
11688 0 

116940 aCat gtaaagtatg acatggggga attggggagg aaggagtgga ataatactgt 

117000 taa9 ataggCattg tgatatggtg cttaaacctg caagtaaagg aaaagagtat 

11706() t9t9 tgtctttttc taa gggcttt ttcccagagt agcttgcagt ctggcttcta 

117120 Ct " CCtatagcca g aacc ctaga ttcacccaga tttaccttca gaattaacta 

117180 9aCt CaaattCaat a g actaaa tg aagtcaggct gctagaggat gtctgctgac 

117240 atat gcagaaagac at gg atcct t gagaaaacat tgtttccaaa agtggccacc 

agcactagag gaaggacagc accacggaca gctcccagac attttaggat tgccttctqt 
117300 ' 

gtttggtgcc cgaacactga gcaaaacagc gaactcagga agtctccaca cactctcata 
117360 

117420 tCat gcagtCCaac taa g aaaa tt cttacataaa atataaggct gtctgcttgg 
117480 aaaC CCttggCtta ta g tctt ttc agtgaatttc tttccttgca aactcgagag 
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ttggagtctc acgactgccc ttgcttcacc aattccccag ctagagacaa aagaccttct 
117540 

tggcctctga cccattttgt ccttgagatt atccaaggac tacaggattc ccctaggaqg 
117 600 " 
tttactgtgt ggaatgaaag caattaagga gctgaataaa agaaataatt gcatgtgaqa 
117660 

atgtggactt ggatgggaag atgtttaaat gagctctgaa agaaacaagc tgccaagagc 
117720 

aattttctaa ttaaagggga ataaaaagat tcaatctcta tttcactcta atccagaaaa 
117780 

catgtcttca tggagaagtg ctcttaaaat ggactcatca gccaaagtgg aaaaacaaaa 
117840 

aacaaaaaaa ctgttcaaca tgagaaggga ccattggtaa atgagtcaag atgctgtgaa 
117 900 ~ " ~ ~ 

accagtagac atttcctttg aataaatgta cttctgcacc ttcaagaact cttacaggaa 
117960 

gtggttgaac aaacaggccc aaaagttcaa aatagttcaa ggtcaaaaca cttgcccttt 
118020 

cttcccagtt ccccaacatc tcactgagtg tcttgagaac ttcacttgat gctatttctc 
118080 

aggagatgtt taggtcaggt tgtccaccca ggtataaaag agaaagagga acgcttatcc 
118140 

cagtctgcaa ggcacattct catggtctgg ttataaagtg tttagtactt cataaaaaag 
118200 

gcactaaaaa tatatataaa ctccccattc ccaagagtta tttgctttgt acccactgcc 
118260 

catgcctaat actctgagct gtatccttcc agggaatgga aaaggtgtta aagcgagtct 
118320 

gattttgttt tgttgcagat gtgacagaca ggaagctgac tatggaagaa gaggaggcca 
118380 

agaggatagc agagatggga aagccagtat tgggtgaaca ccccaaacta gaagtcatca 
118440 

ttgaagagtc ctatgagttc aaggtcaggc aaacagtgag gtctaattga ataataaata 
118500 

aattaaagtg ggaggcagaa gacctggggt gtttttttcc actttcacta gtgaatatgt 
118560 

gaagttgaaa ctgaacaaat cacttaccca ccccaggtct cagtttcccc atttgtaaca 
118620 

tgaaacaaat agtgctgacc atttgtatgc taggaatatt gttaggaaac ataatataga 
118680 

atgtgaaata agtggactag aaagtcctga gatgtattat cattattgtt taactgtgtt 
118740 

tttaaagcaa aaatattaaa actcactact acagggcaag atatattaac atcattatta 
118800 

ttattcatta ttgtattatt ctaaatagcc aatttcaaaa gtcacaacca ggccaggcaq 
118860 

tgagggactc acgcctgtaa tctcagcact ttgagaggcc gagatggaag ggtcacttat 
118920 

acctaggaat ttgagaccag cctgggcaac atagggagac tccatctcta taaaaaataa 
118980 

aacaaaataa aaatcagctc agtgtggttg tacatgcctg tggtcccagc tactcaggag 
119040 "~ 
gctgaggtgg gaggatggct tgagcccagg aggttgaggt tgcaatgagc catgattgca 
119100 

ccactgcact ccagcctggg tgacaaagtg agaccctgtc tcaaacaaaa caaaacaaaa 
119160 

agattacaac caaaaacaaa gggaaataga aggattgcct caaaagagat cgcccaaggc 
119220 

cattccatgc gtaactgtca gaacaccttg ' gagacagggc atctttcatt cctttgaaga 
119280 
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accagactcc tcattggttc tgagcattct aacctcatgg ttccaagttt ttctcttctt ~- 
119340 

aacagactac ggtggacaaa ctgatcaaga agacaaacct ggccttggtt gtggggaccc 
119400 

attcctggag ggaccagttc atggaggcca tcaccgtcag tgcaggtgag aagtgtctca 
119460 

ggctggcctt gctgggagaa gcaggcaacc tctgagaagg aagcgtaaag ccacgttaac 
119520 

agcctgccag tccctaggaa ggcttgtgtg ttcagtcttc ccagctctgg tcctaggtgc 
119580 

ctgcttggaa aagaatcatg gcgtatctga aaaacatggt tatctctggt ttcaaatcgt 
11964 0 

tgttctgctg tgtgaactgg aacaatgtac cctctctgac ctcaatgtcc tctttccaaa 
119700 

ggggaactat tgctaccttt ctcagaaaag tagaaaggta cagagtcttg tataaaatcc 
1197 60 

aaactcaata aattctgatt tctgtcattc tttcttttca tgggtttggt cccgctcttc 
119820 

tgtaaaatgt gggacaattc tgatttagag atgtgggagt taggagttta taaaatgtgt 
119880 

tgcattgact ctccaacaaa acactctgga tgattccata cccctccctc ggcatttact 
11994 0 

gacaggctcc ctcagtagtg acccacagca cagccgggag tcctagcagc ctgaggggac 
120000 

tgctggttgg aacagggacg gaaaaggtct cccaaccacc atcactatca cctctcagca 
120060 

ccactgaggc ctcctggcct tgtcttttat tgagagactt tgttgtcata gcaacccaca 
120120 

gggtcatatc cccaaggccc cagagccaga gcaaaaagac agccaggaag agaggtttgc 
120180 

tgctgctgct gctgctgcta ccccactttt ctcatcacct gctttagatc tttctagctc 
120240 

cccctctgat gacctgactg tgcccctcaa gacaataaac ggaatgtagg ccacatcatc 
120300 

taccctgctc cttttacaaa ggaggggact gaggttcaga aataagagat gatttacccc 
120360 

agcttacaga ttttcttcat ggcaaagctg gaatgagaac ccaagtgttc tgactcctgt 
120420 

tctttcaaaa cccagcttct accggttatg ccaaaacatg acagaagttg ccgttggcaa 
1204 8 0 

ggcacaggca tgcctcagca taccctcccc tccagggctg ctgagtgggc aactctgccc 
12054 0 

acatttcctg gcaaggacaa tcaaggccca tcctgctttt tcccatgaga tgtttggagg 
120600 ~ ~ 

agggcactgg ctctgcagta tattctcgtg atctggaatg acagccatcc ctcaggggac 
120660 

agataatgac cagaaccaca atggttattg cagcagtcag gtcagaaaat ttgagaggag 
120720 

ccctgctggc atccagtgaa gagtggccac accgaactga tttcacttct ctccttagac 
120780 

aacaaaatgc agcctgtgca ttctcctttc tttttttttt taattatact ttaagttctg 
120840 

gggtacatgt gcagaacata gagttttgtt acataggtat acacgtgcca tggcggtttg 
120900 

ctgcacccat caacccgtca tctacattag gtatttctcc taatgctatc cctcccctat 
120960 

ccctcacccc tgacaggctc cagtgtgtga tgttcctctc cctgtgtcca tgtgttctca 
121020 

ttgttcaact cccacttatg agtgagaaca tgcagtgttt ggttttctgt tcttgtgtta 
121080 
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gtttgctgag aatgatggtt 
121140 

taatctcctt ctttcttctc 
121200 

tgccctgcat tcaaattgac 
121260 

acgctggccc tggcccccac 
121320 

gatgcctgtg gtgcgggttg 
121380 

gttctcttgc ctgtttatct 
121440 

ggagaggctg ccctcctgct 
121500 

gctgtttgcc tgtgtgcccc 
121560 

catcctcatc attggcatgc 
121620 

caccattggt ctcaaagatt 
121680 

cccaggtgag agtgagaggt 
121740 

ccctggactc catctcatta 
121800 

gcccttaaag tgcacaaaag 
121860 

aatatttctt gagcactttc 
121920 

tacattttcc ccctgtaatg 
121980 

acttcacaga caaagaaact 
122040 

ccaatttctt aacctccaca 
122100 

agatattgtt ctccttctat 
122160 

tcaaagatat caaaggagaa 
122220 

actgtgagta caaagtatct 
122280 

ttttatagga tgaattacat 
122340 

aggagcggac attcctttac 
122400 

acagtattgt gctagttttt 
122460 

cacgtggagt ttataatcta 
122520 

ttgttaaatc ctaggaagaa 
122580 

tgtaggagtt agcaggtaga 
122640 

aggggctttg gggcctctga 
122700 

cagaaagaaa cagagcacat 
122760 

cttttctggc tgggccaagg 
122820 

gtagctgaat tgacatcaaa 
122880 



tgcatcctcc tttctttctg 
ttccttattc cctggccctc 
ctatgaggca gcccaaattg 
cagctgccca gaagacagct 
ggcttgactt tctcatgaat 
tgccttcagc aggggatgag 
ttgactacgt catgcacttc 
ccacagagta ctgccacggc 
tcaccgccat cattggggac 
cagtcacagc tgttgttttc 
gcttgaattt gcaaagagga 
tcttccacac catctcagat 
tcaatcaaag agatgaataa 
aatgtgacag acaccatgtg 
tttcccaaag ccctattaaa 
gaggcccaca gaggttaagc 
ttatgtgagt agaccacaaa 
ttacctctgg cgatctctga 
atgcccacat acattcttgg 
cctagcagga cagccaaagg 
actctttctt tttcttagga 
tcattgaaca aatatttact 
gggactatag tgaaaggcaa 
ctgaaggagg caactctcaa 
aaagaaaagg tactgcatac 
caggggctgc actagcccag 
actctgaact tctgccaggg 
cctcctgcag ggaagttagg 
gaaatctttc ctgtaccaag 
ttctaggaaa acctctttcc 
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ctccactgtc ttgtccctct 
tctctcccac tctaccttgg 
tttccccact attttctggc 
ggagtcccct tctagcggat- 
gattatctga cttcttaccc 
gatgaggatg aatccgggga 
ctgactgtct tctggaaggt 
tgggcctgct tcgccgtctc 
ctggcctcgc acttcggctg 
gtggcatttg gcacctctgt 
ttttacctgg ttcaaatgac 
ctgaacttaa cagagcctct 
tgacattagt aatgacagct 
tgttcagcaa tttacacatt 
tagggtaagt tattatcccc 
tacatgccca agtaagtggt 
cagtgaaatt aaaagaatgt 
gaggttaaag attagccagc 
cctcctctac ttggaaggac 
aagttccaca gcttttatct 
acactcagag acaaacagaa 
gagcacctat tatgcctgtt 
gatacacatg cttccttctc 
ctactgtaat taaagttatc 
ggaaggaagt tgggcctgaa 
gttctttact taattcagtt 
agctggcatc ccagttgccc 
ctgaatctca tcagacagga 
caaacatatc cttcaagaga 
aaaaccccag cgcaggccag 



_0233086A2_I_> 



WO 02/33086 



PCT/US01/32152 



cggtattatt tgtccattag tgatgcaaga gatttagcta tcgtggaaat gcatcagaag 
12294 0 

gttggaaatt agatggatga tccca.ggaag gcctgtggat gagatgccct gtgatctctg 
123000 

ttctccaagc cttgggggac ctgaactatc agaggggagg gaggaaatat gggggaaagc 
1230 60 

atagaggtgg gaagaaatat cagaggatca gaagcaaaaa acaacaataa caacagaaac- 
123120 

aaaaacaaac aaacaaacaa aaaaacaagg ccataggcaa gaaagggtaa gaggttttct 
12 3180 

ctgggagatc taaaaaaaat ggcaataatg aggtaagcca ggcagatacc tttgggcatc 
123240 

tccaagtcct tgcaattggc caagacaaca gctaacaaca tttgaggctt taagaaggtt 
123300 

accctgtgat ccactcatct gatttagtgg ctttggctga agctctttgg atatagttga 
123360 

aggtacggaa agggtcctta catgaggact ttagggtcaa gtctcttgct aacatcctat 
123420 

gtgaccttgg gtaaattctt tgacccttat ttttcttacc tgtaaaataa aagaattggg 
123480 

ctagatgtct ctgacagtcc tccctgtatc tacaatctgt gccaagatct aaagtcaaac 
12354 0 

accctgcaag gccctgtgat acatatataa accacaaaga cagagccccg tcttccttga 
123600 

gtccacagtt caccctgcat gtccccatca tggttcccca acatgtcctc tgtccccaaa 
123660 

atccagcacc tcacccagtg ctcaatcagt aggcattgct caataactgt tggtggttcg 
123720 

tgaataaatg ccccatatga cagttaaaat caggcatcta ctccaagcag cttcccaggg 
123780 

tgtcaaggtt ccctggggag atattatggg atggcaaact tcccttactg aaaaagtagt 
123840 

caaaggagaa caataagccc actcagtaaa tatcagaact ggaaagccct tcagaatctt 
123900 

tcagatcact gcagatgagg aatgggaagc ccagactagg gatgtgacct acccagggcc 
123960 

acacggcttg cttgcggcag aactaggagt taggagtggc cccctagccc ttgtctctca 
124020 

ttcctgggtt cagcccacca gctcaagctg ctttttgggc atactggaag acaagccctg 
124080 

cacaccttag cctcctacca gttcccatgt gtctttgtcc ttttccagat acgtttgcca 
124140 

gcaaagctgc tgccctccag gatgtatatg cagacgcctc cattggcaac gtgacgggca 
124200 

gcaacgccgt caatgtcttc ctgggcatcg gcctggcctg gtccgtggcc gccatctact 
124260 

gggctctgca gggacaggag ttccacgtgt cggccggcac actggccttc tccgtcaccc 
124320 

tcttcaccat ctttgcattt gtctgcatca gcgtgctctt gtaccgaagg cggccgcacc 
124380 

tgggagggga gcttggtggc ccccgtggct gcaagctcgc cacaacatgg ctctttgtga 
124440 

gcctgtggct cctctacata ctctttgcca cactagaggc ctattgctac atcaaggggt 
124500 

tctaagccac acaacagagc ctccagcagg gcaggcctag gacttctcct aagagaaggg 
124 560 

cacttcccca ccagtgatct ctcccgactg cactgccctg gagaggcagc atcaggacct 
124 620 

aagccccagg aacttcaccc aacttaggcc ctggcaatta actgaaaggg caaagtctta 
124 680 
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atcaatcaaa 

124740 

caacagcaac 

124800 

gtcagagcct 

124860 

ttggaagcag 

124920 

gacaagacgt 

124980 

atcattattg 

125040 

tttttttctg 

125100 

gagaaacatg 

125160 

gttccccatc 

125220 

taagctcact 

125280 

ggagcttttt 

125340 

ccaagccttt 

125400 

aggaagaccc 

125460 

ggctactgga 

125520 

ggcaggagag 

125580 

tgtcaatgtg 

125640 

aagagggctg 

125700 

aggggctccc 

125760 

aggaactgga 

125820 

tttttccagc 

125880 

cttggagcct 

125940 

atgtgggaag 

126000 

tgtcatcacg 

126060 

taaggggggg 

126120 

aaaaataatt 

126180 

aaatgttggg 

126240 

tgcttctacc 

126300 

gagacgtgac 

126360 

ccagtatatt 

126420 

cgaagttgga 

126480 



caatggagga 
aaatccacct 
ttcgcctcct 
ggtttctcct 
gaaaatgagt 
tggttgcttt 
aagtgagtga 
aatttttaga 
tcactgtttt 
cacccacgca 
atgcttatgt 
ccaattctgt 
gccaaataga 
agaacaaaag 
aagagatgtc 
cctgagaaat 
actatatttc 
ttcacctctc 
a t tec teat t 
ctatggggta 
gggtttttgg 
gacctggctt 
aaactgccca 
ggggaagaaa 
aaaaaagaaa 
ttggttcaaa 
tgtgtttctt 
tcaggtgaag 
ctaagcaaaa 
agatgtaagt 



atcaccgact 
ccaccccatc 
tctattccat 
ctctgcccaa 
cacattcatg 
cgttttgccg 
aaaaggtgcc 
cacatgctct 
ctctgagttc 
gggtactaaa 
tgaatcaagt 
gtgtcttgtt 
agagatggga 
ggatacaggc 
caaccagtga 
gtcagcttca 
ttgaagtggt 
ctggtccctc 
tctcccatgt 
gttcttgctc 
agaaagagat 
taaaacacgc 
cggatcttta 
ttaaaaaaca 
cagtgctcat 
aaaaaaataa 
cagtgaaagg 
gtgtcacctc 
atacttcagg 
agtagtaaaa 



ttacacagta 
tccccctcat 
cttttgatta 
ttccatatgt 
tggctggggt 
ttaggttttg 
acaaaggaat 
cctgccacct 
ttgggaggag 
gatcttacag 
cagatacaaa 
gtgtcagtgt 
caaaaatagg 
cttcttgatt 
gatctttaag 
ccacgctctt 
gtaaaaactt 
aggtcacata 
tcctgcttgt 
cagtaagagg 
ctccgtgccc 
acacaaacac 
ggctttctgc 
cctgttaatt 
gtcataagct 
aagctatact 
tccagggggc 
ctctcacact 
aaaatgccac 
gctgtggtaa 

74 



tttaattgaa 
atccctgacc 
ttcctttgcc 
ccctattatc 
ggggttcttt 
cttattattt 
tccaggtccg 
cttggctcca 
tgatggtgtt 
gagcttcaac 
aagcaattgt 
ccacttgtgt 
aatggtgtgt 
atctttggct 
agaaaagttt 
gcttcctaat 
agagatttta 
tgaaagcatt 
tcttaaactt 
aatcttagtt 
tacagacctt 
acaaataaac 
attgacataa 
ttaaacacat 
atgttgacag 
atatctctct 
cactgtgggc 
caggtgccaa 
ttgtcaaaac 
ttatggagga 



tacaaacaag 
caaagcaaag 
tctcatttct 
tcactcagct 
tttcattgta 
tgttttgtct 
agccaacaga 
tcaagatcca 

ggggtagaaa 

tggagcagga 
ccctctttgc 
atccttctgc 
gacgacaaag 
ttgtacctga 
gtattttaaa 
gctctataca 
taagagaacc 
tttacaagat 
catgaagcta 
gtcataatcc 
ttctcaacga 
agacataaga 
atacattttc 
tttttaagaa 
ttgccagtgg 
acatacagct 
ttcttgtgag 
tgtgtcagac 
ctggactttg 
aggaggtttc 
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tgtatcagaa aggcattggc cgtgacagac tc 
126512 



<210> 4 

<211> 927 

<212> PRT 

<213> Rat 

<400> 4 

Met Ala Trp Leu Arg Leu Gin Pro Leu Thr Ser Ala Phe Leu His Phe 

15 10 15 

Gly Leu Val Thr Phe Val Leu Phe Leu Asn Gly Leu Arg Ala Glu Ala 

20 25 30 

Gly Asp Leu Arg Asp Val Pro Ser Ala Gly Gin Asn Asn Glu Ser Cys 

35 40 45 

Ser Gly Ser Ser Asp Cys Lys Glu Gly Val lie Leu Pro He Trp Tvr 

50 55 60 

Pro Glu Asn Pro Ser Leu Gly Asp Lys He Ala Arg Val He Val Tvr 
65 70 75 80 

Phe Val Ala Leu He Tyr Met Phe Leu Gly Val Ser He He Ala Asp 

85 90 95 

Arg Phe Met Ala Ser He Glu Val He Thr Ser Gin Glu Arg Glu Val 

100 105 HO 

Thr. He Lys Lys Pro Asn Gly Glu Thr Ser Thr Thr Thr He Arg Val 

115 120 125 

Trp Asn Glu Thr Val Ser Asn Leu Thr Leu Met Ala Leu Gly Ser Ser 

130 135 140 

Ala Pro Glu He Leu Leu Ser Leu He Glu Val Cys Gly His Glv Phe 
X f 5 150 155 • 160 

He Ala Gly Asp Leu Gly Pro Ser Thr He Val Gly Ser Ala Ala Phe 

165 170 175 

Asn Met Phe He He He Gly He Cys Val Tyr Val He Pro Asp Gly 

180 185 190 

Glu Thr Arg Lys He Lys His Leu Arg Val Phe Phe Val Thr Ala Ala 

195 200 205 

Trp Ser Val Phe Ala Tyr He Trp Leu Tyr Met lie Leu Ala Val Phe 

210 215 220 

Ser Pro Gly Val Val Gin Val Trp Glu Gly Leu Leu Thr Leu Phe Phe 
225 230 235 240 

Phe Pro Val Cys Val Leu Leu Ala Trp Val Ala Asp Lys Arg Leu Leu 

245 250 255 

Phe Tyr Lys Tyr Met His Lys Arg Tyr Arg Thr Asp Lys His Arg Gly 

260 265 ' 270 

He He He Glu Thr Glu Gly Glu His Pro Lys Gly He Glu Met Asp 

275 280 285 

Gly Lys Met Met Asn Ser His Phe Leu Asp Gly Asn Leu He Pro Leu 

290 295 300 

Glu Gly Lys Glu Val Asp Glu Ser Arg Arg Glu Met He Arg He Leu 
305 310 315 320 

Lys Asp Leu Lys Gin Lys His Pro Glu Lys Asp Leu Asp Gin Leu Val 

325 330 ' 335 

Glu Met Ala Asn Tyr Tyr Ala Leu Ser His Gin Gin Lys Ser Arg Ala 

340 345 350 

Phe Tyr Arg He Gin Ala Thr Arg Met Met Thr Gly Ala Gly Asn He 

355 360 365 

Leu Lys Lys His Ala Ala Glu Gin Ala Lys Lys Thr Ala Ser Met Ser 

370 375 380 

Glu Val His Thr Asp Glu Pro Glu Asp Phe Ala Ser Lys Val Phe Phe 
385 390 395 400 
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Asp 


Pro 


Cys 


Ser 


Tyr 


Gin 


Cys 


Leu 










405 








Thr 


Val 


Val 


Arg 


Lys 


Gly 


Gly Asp 








420 










Tyr 


Lys 


Thr 


Glu 


Asp 


Gly 


Ser 


Ala 






435 










440 


Thr 


Glu 


Gly 


Thr 


Val 


Val 


Leu 


Lys 




450 










455 




Ser 


Val 


Glv 


He 


He 


Asp 


Asp 


Asp 


465 










470 






Phe 


Val 


Arg 


Leu 


Ser 


Asn 


Val 


Arg 










485 








Gly 


Met 


Thr 


Pro 


Ala 


lie 


Leu 


Asn 








500 










Leu 


Ala 


Ser 


Pro 


Cys 


Val 


Ala 


4. li J. 






515 










{- \j 


Ala 


Gly 


He 


Ph<? 


ZUr 


Phe 


Gl u 


_y o 




530 










5 35 




He 


Glv 


Val 


M--r 


3 » u 


V/i 1 




V^ 1 


545 










r c 






Thr 


Val 


He 


\ \i i 


i * o 

r , t ( 


F'f »•* 


*•* l i 


Thr 


Gly 


Glu 


Asp 


Ph. 


C : j 




Tf.r 


Tyr 








Sfc'J 










Glu 


Thr 


Val 


Lyc 


TM 


r i *? 


At cj 


Val 






595 










600 


Glu 


Arq 


Gin 


Gl u 


Ar.r. 




?ht_- 


I le 




610 










t 1 t 




Glu 


Arq 


Gly 


He- 


r,er 


Ala 




Le u 


625 
















Lvs 


Leu 


Thr 


Met 


c : u 

o*l 5 


Glu 


Glu 


Glu 


Lvs 


Pro 


Val 


Leu 


C 1 v 


Glu 


His 


P Y~ O 

r i. u 








660 










Ser 


Tvr 


Glu 


Phe 


Lvs 


Ser 


Thr 
i in. 


v a J. 






675 










680 


Leu 


Ala 


Leu 


Val 




G 1 v 


•"!*h r 
* 1 1 1 


Ui c 

nio 




690 








695 




Ala 


He 


Thr 


Val 


Ser 


Ala 


Ala 


Gl w 


705 










710 




Glu 


Glu 


Arg 


Leu 


Pro 


Ser 


cys 


tJ its 










725 






Val 


Phe 


Trp 


Lvs 


Val 


Leu 


Phe 


Ala 








740 










His 


Gly 


Trp 


Ala 


Cvs 


Phe 


Val 


V Q JL 






755 










760 


Thr 


Ala 


He 


He 


Glv 


Asp 


Leu 


Ala 




770 










775 




Leu 


Lys 


Asp 


Ser 


Val 


Thr 


Ala 


Val 


785 










790 






Val 


Pro 


Asp 


Thr 


Phe 


Ala 


Ser 


Lys 










805 






Ala 


Asp 


Ala 


Ser 


He 


Glv 


Asn 


Val 








820 










Phe 


Leu 


Gly 


He 


Gly 


Leu 


Ala 


Trp 






835 










840 


Met 


Gin 


Gly 


Gin 


Glu 


Phe 


His 


Val 




850 










855 




Val 


Thr 


Leu 


Phe 


Thr 


He 


Phe 


Ala 


865 










870 






Tyr 


Arg 


Arg 


Arg 


Pro 


His 


Leu 


Gly 



Glu 


Asn 


Cys 


Gly Ala 


Val 


Leu 


Leu 




410 










415 




He 


Ser 


Lys 


Thr 


Met 


Tvr 


Val 


Asp 


425 










430 






Asn 


Ala 


Gly 


Ala 


Asp 


Tvr 

jr 


Glu 


Phe 










445 








Pro 


Gly Glu 


Thr 


Gin 


Lys 


Glu 


Phe 








4 60 










He 


tilt: 


G 1 ii 

V31U 


Glu 


Asp 


Glu 


His 


Phe 






475 










*3 O \J 


Val 


gi ii 


Glu 


Gl n 

VJ JL UL 


Gin 


Leu 


Glu 


Gin 














4 95 




Ser 




Pro 


T.^n 

J-J" UL 


Pro Arg 


Ala 


Val 


505 










510 






Val 


i nr 


He 


-Lie Ul 


Asp 


Asp 


Asp 


His 










525 








Asp 


i nr 


He 


His 


Val 


Ser 


Glu 


Ser 








540 










Leu 


Arg 


Thr 


Ser 


Gly Ala 


Arg 


Gly 






555 










560 


Val 




Gly 


Thr 


Ala 


Lys 


Gly 


Gly 




570 










575 




Gly 


gi ii 

V3J.U 


Leu Glu 


Phe 


Lys 


Asn 


Asp 


585 










590 






Lvs 


He 


Val 


Asp 


Glu 


Glu 


Glu 


Tvr 










605 








Ala 


Leu 


Gly 


Glu 


Pro 


Lys 


T rn 


Met 








620 










Leu 


Ser 


Pro 


Glu 


Val 


Thr 


Asp 


Arg 






635 










640 


Ala 


Lys 


Arg 


He 


Ala 


Glu 


Met 


Gly 




650 










u j j 




Lvs 


Leu 


Glu 


Val 


He 


He 


Glu 


Glu 


665 










670 






Asp 


Lys 


Leu 


He 


Lys 


Lys 


Thr 


.fio 1 1 










685 








Ser 


Trp Arg 


Asp 


Gin 


Phe 


Met 


Glu 








700 










Asp 


Glu 


Glu 


Glu Asp Glu 


Ser 


Gly 






715 










ion 


Asp 


Tyr 


Val 


Met 


His 


Phe 


Leu 


Thr 




730 














Cys 


Val 


Pro 


Pro 


Thr 


Glu 


Tvr 


Cys 


745 










750 






Ser 


He 


Leu 


He 


He 


Gly 


Met 


Leu 










765 








Ser 


His 


Phe 


Gly 


Cys 


Thr 


He 


Gly 








780 










Val 


Phe 


Val 


Ala 


Phe 


Gly 


Thr 


Ser 






795 










800 


Ala 


Ala 


Ala 


Leu 


Gin 


Asp 


Val 


Tvr 




810 










815 




Thr 


Gly 


Ser 


Asn 


Ala 


Val 


Asn 


Val 


825 










830 






Ser 


Val 


Ala 


Ala 


He 


Tyr 


Trp 


Ala 










845 








Ser Ala Gly Thr Leu Ala 


Phe 


Ser 








860 








* 


Phe 


Val 


Cys 


Leu 


Ser 


Val 


Leu 


Leu 






875 










880 


Gly Glu Leu 


Gly Gly 


Pro 


Arg 


Gly 
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885 890 895 "" ~" 

Cys Lys Leu Ala Thr Thr Trp Leu Phe Val Ser Leu Trp Leu Leu Tyr 

900 905 910 

Val Leu Phe Ala Thr Leu Glu Ala Tyr Cys Tyr lie Lys Gly Phe 
915 920 ' " 925 
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